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Methods of Diagnosing & Treating Obesity, Diabetes and Insulin 

Resistance 

CROSS-REFERENCE TO RELATED PATENT APPLCIATIONS 
5 [01] The present application claims priority to U.S. Provisional Patent 

Application No. 60/ 580,448, filed June 16, 2004, which is incorporated by reference in its 
entirety for all purposes. 

BACKGROUND OF THE INVENTION 

1 0 [02] Obesity has reached epidemic proportions globally with more than 1 

billion adults overweight- at least 300 million of them clinical obese- and is a major 
contributor to the global burden of chronic disease and disability. Overweight and obesity 
leads to adverse metabolic effect on blood pressure, cholesterol, triglycerides and insulin 
resistance. The non-fatal but debilitating health problems associated with obesity include 

15 respiratory difficulties, chronic musculoskeletal problems, skin problems and infertility. The 
more life-threatening problems fall into four main areas: cardiovascular disease problems, 
conditions associated with insulin resistance such as Type 2 diabetes, certain types of cancers 
especially the hormonally related and large-bowel cancers, and gall bladder disease. The 
likelihood of developing Type 2 diabetes and hypertension rises steeply with increasing body 

20 fatness. Weight reduction leads to correction of a number of obesity- associated endocrine 
and metabolic disorders. 

[03] Effective weight management for individuals and groups at risk of 
developing obesity involves a range of long term strategies. These include prevention, weight 
maintenance, management of co-morbidities and weight loss. Existing treatment strategies 

25 include calorific restriction programs, surgery (gastric stapling) and drug intervention. The 
currently available anti-obesity drugs can be divided into two classes: central acting and 
peripheral acting. Three marketed drugs are Xenical (Orlistat), Merida (Sibutramine) and 
Adipex-P (Phentermine). Xenical is a non-systemic acting GI lipase inhibitor which is 
indicated for short and long term obesity management. Merida reduces food intake by re- 

30 uptake inhibition of primarily norepinephrine and serotonin. Adipex-P is a phenteramine 
with sympathomimetic activities and suppresses appetite. It is indicated only for short term 
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use. A more drastic solution to permanent weight loss is surgery and a gastric by-pass which 
limits absorption of calories through massive reduction in stomach size. 

[04] Carrying extra body weight and body fat go hand and hand with the 
development of diabetes. People who are overweight (BMI greater than 25) are at a much 
5 greater risk of developing type 2 diabetes than normal weight individuals. Almost 90% of 
people with type 2 diabetes are overweight. 

[OS] Diabetes mellitus can be divided into two clinical syndromes, Type 1 
and Type 2 diabetes mellitus. Type 1, or insulin-dependent diabetes mellitus (IDDM), is a 
chronic autoimmune disease characterized by the extensive loss of beta cells in the pancreatic 
10 Islets of Langerhans, which produce insulin. As these cells are progressively destroyed, the 
amount of secreted insulin decreases, eventually leading to hyperglycemia (abnormally high 
level of glucose in the blood) when the amount of secreted insulin drops below the level 
required for euglycemia (normal blood glucose level). Although the exact trigger for this 
immune response is not known, patients with IDDM have high levels of antibodies against 
1 5 proteins expressed in pancreatic beta cells. However, not all patients with high levels of 
these antibodies develop IDDM. 

[06] Type 2 diabetes (also referred to as non-insulin dependent diabetes 
mellitus (NIDDM)) develops when muscle, fat and liver cells fail to respond normally to 
insulin. This failure to respond (called insulin resistance) may be due to reduced numbers of 
20 insulin receptors on these cells, or a dysfunction of signaling pathways within the cells, or 
both. The beta cells initially compensate for this insulin resistance by increasing insulin 
output. Over time, these cells become unable to produce enough insulin to maintain normal 
glucose levels, indicating progression to Type 2 diabetes. 

[07] Type 2 diabetes is brought on by a combination of genetic and 
25 acquired risk factors - including a high-fat diet, lack of exercise, and aging. Worldwide, 
Type 2 diabetes has become an epidemic, driven by increases in obesity and a sedentary 
lifestyle, widespread adoption of western dietary habits, and the general aging of the 
population in many countries. In 1985, an estimated 30 million people worldwide had 
diabetes - by 2000, this figure had increased 5-fold, to an estimated 154 million people. The 
30 number of people with diabetes is expected to double between now and 2025, to about 300 
million. 

[08] Type 2 diabetes is a complex disease characterized by defects in 
glucose and lipid metabolism. Typically there are perturbations in many metabolic 
parameters including increases in fasting plasma glucose levels, free fatty acid levels and 

2 
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triglyceride levels, as well as a decrease in the ratio of HDL/LDL. As discussed above, one 
of the principal underlying causes of diabetes is thought to be an increase in insulin resistance 
in peripheral tissues, principally muscle and fat. 

[09] Therapies aimed at reducing peripheral insulin resistance are available. 
S The most relevant to this invention are drugs of the thiazolidinedione (TZD) class namely 
troglitazone, pioglitazone, and rosiglitazone. In the US these have been marketed under the 
names Rezulin™, Avandia™ and Actos™, respectively. The principal effect of these drugs is 
to improve glucose homeostasis. Notably in diabetics treated with TZDs there are increases 
in peripheral glucose disposal rates indicative of increased insulin sensitivity in both muscle 
10 and fat. 

[10] The molecular target of TZDs is a member of the PPAR family of 
ligand-activated transcription factors called PPAR gamma. This transcription factor is highly 
expressed in adipose tissue with much lower levels being observed in muscle. Binding of 
TZDs to PPAR gamma in target cells and tissues such as fat and muscle brings about a 
15 change in gene expression. The link between TZD-altered gene expression in fat and muscle 
and increased insulin sensitivity is unknown. The present invention addresses this and other 
problems. 

BRIEF SUMMARY OF THE INVENTION 
20 The present invention provides methods for identifying an agent for treating an obese, 

diabetic or pre-diabetic individual. In some embodiments, the method comprising the steps 
of: (i) contacting an agent to a polypeptide encoded by a polynucleotide that is substantially 
identical to or hybridizes to a nucleic acid encoding a polypeptide listed in Table 1 under 
hybridization conditions of 50% formamide, 5X SSC, and 1% SDS at 42°C followed by a 
25 wash in 0.2X SSC, and 0. 1 % SDS at 55°C, wherein the polypeptide optionally has the 

activity listed in Table 1; and (ii) selecting an agent that modulates the expression or activity 
of the polypeptide or that binds to the polypeptide, thereby identifying an agent for treating 
an obese, diabetic or pre-diabetic individual 

Table 1: List of Polypeptides, SEQ ID numbers and Proposed Activity 



PROTEIN NAME 


SEQ ID NO: 


PROPOSED ACTIVITY (S) 


Adlican 


2,4,6 


Signal Transduction 


ALDH1A3 


8, 10, 12 


Aldehyde dehydrogenase 


ALK7 


14, 16, 18 


Receptor Serine/threonine protein kinase 
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C3AR1 


20, 22, 24 


G-protein coupled receptor for Complement 
Component C3a 


CALCRL 


26, 28, 30 


G-protein coupled receptor for adrenomedullin or 
calcitonin gene-related protein 


CCL13 


32, 33, 35, 37 


Chemokine 


CCL8 


39, 40,42,44 


Chemokine 


CHBL1 


46,47,49,51 


Glycosyl hydrolase 


CR1 


53, 55 


Transmembrane receptor for Complement 
Component C3b-C4b 


CSFR1 


57, 59, 61 


Receptor tyrosine kinase 


CTSK 


63, 64, 66, 68 


Cysteine protease 


CXCR4 


70, 72, 74 


G-protein coupled receptor for CXCL12 


DDAH2 


76, 78, 80 


Amidinotransferase 


DERP7 


82, 84, 86 


7 transmembrane protein 


ENDOGLYX1 


88, 90, 92 


Modifier of extracellular matrix 


ETL 


94, 96, 98 


G-protein coupled receptor 


FU12389 


100, 102, 104, 106, 108 


Acetoacetate CoA ligase 


FZD4 


110,112,114,116 


7 Transmembrane receptor for Wnt proteins 


GLIPR1 


118,120,122 


Apoptosis Regulator 


GPR105 


124, 126, 128 


G-protein coupled receptor for UDP sugars 


GPR146 


130, 132, 134 


G-protein coupled receptor 


GPR30 


136, 138, 140 


G-protein coupled receptor 


GPR65 


142, 144, 146 


G-protein coupled receptor for psychosine 


HTR2B 


148,150,152 


G-protein coupled receptor for serotonin 


ITGB2 


154, 156, 158 


Cell Adhesion 


ITIH5 


160, 161, 163 


Extracellular matrix stabilization 


LGALS12 


165, 167, 169, 171, 173, 
175, 177, 179 


Apoptosis Regulator 


NMB 


181,182,184,186 


Ligand for the G-protein coupled receptor, NMBR 


NNAT 


188, 190, 192, 194 


Regulator of ion channels 


OLFM2 


196, 197,199,201 


Secreted protein 


OPN3 


203,205,207,209,211 


G-protein coupled receptor 
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PTPRE 


213,215,217,219,221 


Protein tyrosine phosphatase 


RDC1 


223, 225,227 


G-protein coupled receptor 


SLIT2 


229, 230, 232, 234 


Ligand for roundabout receptor, ROBOl 


TNFRSF21 


236, 238, 240 


Transmembrane receptor 


TNFSF13B 


242, 243,245,247,249 


Ligand for TNSRSF13B, TNFRSF13C and 
TNFRSF17B 


TNFSF14 


251,252,254, 256, 258, 
260 


Ligand forTNFRSF14 


TPSB2 


262,263,265,267 


Serine protease 


WISP2 


269. 270, 272, 274 


Growth Regulator 



In some embodiments, the polypeptide comprises an amino acid sequence at least 95% 
identical to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 
39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
5 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 1 14, 1 16, 118, 120, 122, 124, 
126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 
163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 
199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 
236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 

1 0 270, 272, 274 or a protein domain thereof. In some embodiments, the method further 
comprises detecting whether the selected agent modulates weight and/or obesity. In some 
embodiments, the method further comprising detecting whether the selected agent modulates 
insulin sensitivity. 

(11] In some embodiments, step (ii) comprises selecting an agent that 

1 5 modulates expression of the polypeptide. In some embodiments, step (ii) comprises selecting 
an agent that modulates the activity of the polypeptide. In some embodiments, step (ii) 
comprises selecting an agent that specifically binds to the polypeptide. 

[12] In some embodiments, the polypeptide is expressed in a cell and the 
cell is contacted with the agent. In some embodiments, the polypeptide comprises SEQ ID 

20 NO: 2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 
49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 
98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 
136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 
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173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 
209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 
245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 270, 272 or 274. 

(13] The present invention also provides methods of reducing body weight 
5 in an animal. In some embodiments, the methods comprise administering to the animal an 
effective amount of an agent that modulates the activity or expression of SEQ ID NO: 2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 
55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 
102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
10 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 209, 211, 
213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 
249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 270, 272 or 274. 

[14] In some embodiments, the agent is selected by a method comprising (i) 
15 contacting an agent to a mixture comprising a polypeptide encoded by a polynucleotide is 
substantially identical to or hybridizes to a nucleic acid encoding a polypeptide listed in Table 
1 under hybridization conditions of 50% formamide, 5X SSC, and 1% SDS at 42°C followed 
by a wash in 0.2X SSC, and 0.1% SDS at 55°C, wherein the polypeptide optionally has the 
activity listed in Table 1 ; and (ii) selecting an agent that modulates the expression or activity 
20 of the polypeptide or that binds to the polypeptide. 

[15J In some embodiments, the agent is an antibody. In some embodiments, 
the antibody is a monoclonal antibody. In some embodiments, the animal is a human. 

|16] The present invention also provides methods of treating a diabetic or 
pre-diabetic animal. In some embodiments, the method comprising administering to the 
25 animal a therapeutically effective amount of an agent that modulates the activity or 

expression of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 
39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92,94, 96,98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 
126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 
30 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 
199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 
236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 
270, 272 or 274. In some embodiments, the polypeptide comprises an amino acid sequence 
at least 95% identical to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
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33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 1 02, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 
122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 
196, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 
232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 
267, 269, 270, 272, 274 or a protein domain thereof. 

(1 7J In some embodiments, the agent is selected by a method comprising 
(i) contacting an agent to a mixture comprising a polypeptide encoded by a polynucleotide 
that hybridizes to a nucleic acid encoding SEQ ID NO: 2, 4, 6, 8, 10, 1 2, 14, 16, 18, 20, 22, 
24, 26, 28, 30^ 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 
70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 
188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 
225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 
260, 262, 263, 265, 267, 269, 270, 272 or 274 in 50% formamide, 5X SSC, and 1% SDS at 
42°C followed by a wash in 0.2X SSC, and 0.1% SDS at 55°C; and (ii) selecting an agent that 
modulates the expression or activity of the polypeptide or that binds to the polypeptide. 

[18] In some embodiments, the agent is an antibody. In some embodiments, 
the antibody is a monoclonal antibody. In some embodiments, the animal is a human. 

[19] The present invention also provides methods of introducing an 
expression cassette into a cell. In some embodiments, the methods comprise introducing into 
the cell an expression cassette comprising a promoter operably linked to a polynucleotide 
encoding a polypeptide, wherein the polynucleotide is substantially identical to or hybridizes 
to a nucleic acid encoding a polypeptide listed in Table 1 under hybridization conditions of 
50% formamide, 5X SSC, and 1% SDS at 42°C followed by a wash in 0.2X SSC, and 0.1% 
SDS at 55°C, and the polypeptide optionally has the activity listed in Table 1 . In some 
embodiments, the polypeptide comprises an amino acid sequence at least 95% identical to 
SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 
46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 
94,96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 163, 165, 167, 
169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 199, 201, 203, 
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205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 236, 238, 240, 
242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 270, 272, 274 
or a protein domain thereof. 

[201 In some embodiments, the polypeptide comprises SEQ ID NO: 2, 4, 6, 
5 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 
55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 
102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 
177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 209, 21 1, 
10 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 
249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 270, 272 or 274. 

[21] In some embodiments, the cell is selected from the group consisting of 
an adipocyte and a skeletal muscle cell. 

[22] In some embodiments, the method further comprises introducing the 
1 5 cell into a human. In some embodiments, the human is obese. In some embodiments, the 
human is diabetic. In some embodiments, the human is prediabetic. In some embodiments, 
the cell is from the human. 

[23] The present invention also provides methods of diagnosing an 
individual who has obesity, Type 2 diabetes or has a predisposition for diabetes or obesity. In 
20 some embodiments, the method comprises detecting in a sample from the individual the level 
of a polypeptide or the level of a polynucleotide encoding the polypeptide, wherein the 
polynucleotide is substantially identical to or hybridizes to a nucleic acid encoding a 
polypeptide listed in Table 1 under hybridization conditions of 50% formamide, 5X SSC, and 
1% SDS at 42°C followed by a wash in 0.2X SSC, and 0.1% SDS at 55°C, wherein a 
25 modulated level of the polypeptide or polynucleotide in the sample compared to a level of the 
polypeptide or polynucleotide in either a lean individual or a previous sample from the 
individual indicates that the individual is obese or diabetic or has a predisposition for diabetes 
or obesity. 

[24] In some embodiments, the detecting step comprises contacting the 
30 sample with an antibody that specifically binds to the polypeptide. In some embodiments, the 
amino acid sequence comprises SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 
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158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 
194, 196, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 223, 225, 227, 229, 
230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 
265, 267, 269, 270, 272 or 274. In some embodiments, the detecting step comprises 
5 quantifying mRNA encoding the polypeptide. In some embodiments, the mRNA is reverse 
transcribed and amplified in a polymerase chain reaction. In some embodiments, the sample 
is a blood, urine or tissue sample. 

[25] The present invention provides for an isolated nucleic acid that is 
substantially identical to or hybridizes to a nucleic acid encoding a polypeptide listed in Table 

1 0 1 under hybridization conditions of 50% formamide, 5X SSC, and 1 % SDS at 42°C followed 
by a wash in 0.2X SSC, and 0.1% SDS at 55°C In some embodiments, the polynucleotide 
comprises SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 34, 36, 38, 41, 
43, 45, 48, 50, 52, 54, 56, 58, 60, 62, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 
95, 97,99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 

15 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 162, 164, 166, 168, 170, 
172, 174, 176, 178, 180, 183, 185, 187, 189, 191, 193, 195, 198, 200, 202, 204, 206, 208, 210, 
212, 214, 216, 218, 220, 222, 224, 226, 228, 231, 233, 235, 237, 239, 241, 244, 246, 248, 250, 
253, 255, 257, 259, 261, 264, 266, 268, 271 or 273. In some embodiments, the polynucleotide 
encodes SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 

20 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 
88, 90,92,94,96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 163, 
165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 199, 
201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 236, 

25 238, 240, 242, 243, 245, 247, 249, 25 1 , 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 270, 
272 or 274. 

(26] The present invention also provides expression cassettes comprising a 
heterologous promoter operably linked to a nucleic acid that is substantially identical to or 
hybridizes to a nucleic acid encoding a polypeptide listed in Table 1 under hybridization 
30 conditions of 50% formamide, 5X SSC, and 1 % SDS at 42°C followed by a wash in 0.2X 
SSC, and 0. 1 % SDS at 55°C. 

[27] The present invention also provides host cells transfected with nucleic 
acids that is substantially identical to or hybridizes to a nucleic acid encoding a polypeptide 
listed in Table 1 under hybridization conditions of 50% formamide, 5X SSC, and 1% SDS at 
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42°C followed by a wash in 0.2X SSC, and 0.1% SDS at 55°C. In some embodiments, the 
host cell is a human cell. In some embodiments, the host cell is a bacterium. 

[28] The present invention also provides isolated polypeptides comprising 
an amino acid sequence at least 70% identical to SEQ ID NO. 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
5 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 
68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 
114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 
188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 

10 225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 25 1 , 252, 254, 256, 258, 
260, 262, 263, 265, 267, 269, 270, 272, 274 or fragments thereof. In some embodiments, the 
polypeptide comprises SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 1 14, 1 16, 1 18, 120, 

15 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 
160, 161, 163, 165, 1'67, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 
196, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 
232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 
267, 269, 270, 272 or 274. 

20 [29] The present invention also provides antibodies that specifically bind to 

a polypeptide selected from the groups consisting of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 

25 150, 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 
186, 188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 217, 219, 221, 
223, 225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 
258, 260, 262, 263, 265, 267, 269, 270, 272 or 274. 

[30] The present invention also provides pharmaceutical compositions 

30 comprising polypeptides comprising an amino acid sequence at least 70% identical to SEQ 
ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40; 42, 44, 46, 
47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 
96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 
134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 
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171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 
207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 
243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 270, 272, 274 or 
fragments thereof, and a pharmaceutically-acceptable excipient. 

DEFINITIONS 

[31] "Insulin sensitivity" refers to the ability of a cell or tissue to respond to 
insulin. Responses include, e.g., glucose uptake of a cell or tissue in response to insulin 
stimulation. Sensitivity can be determined at an organismal, tissue or cellular level. For 
example, blood or urine glucose levels following a glucose tolerance test are indicative of 
insulin sensitivity. Other methods of measuring insulin sensitivity include, e.g., measuring 
glucose uptake {see, e.g., Garcia de Herreros, A., and Birnbaum, M. J. 7. Biol Chem. 264, 
19994-19999 (1989); Klip, A., Li, G., and Logan, WJ. Am. J. Physiol 247, E291-296 
(1984)), measuring the glucose infusion rate (GINF) into tissue such as the skeletal muscle 
(see, e.g., Ludvik et aU J- Clin. Invest. 100:2354 (1997); Frias el a/., Diabetes Care 23:64, 
(2000)) and measuring sensitivity of GLUT4 translocation (e.g., as described herein) in 
response to insulin. 

1321 As used herein, an overweight person has a body mass index (BM1) £ 
25 and an "obese" person has a BMI £30. BMI is calculated as the weight in kilograms 
divided by the square of the height in meters. 

(33] The term 4< waist-to-hip ratio or WHR" is the ratio of a persons waist 
circumference to hip circumference. For most people, carrying extra weight around their 
middle increases health risks more than carrying extra weight around their hips or thighs. For 
both men and women, a waist-to-hip ratio of 1.0 or higher is considered "at risk" or in the 
danger zone for undesirable health consequences, such as heart disease and other ailments 
connected with being overweight. 

[34] The term "adipogenic," when used in reference to cells refers to a cell 
which can become an adipocyte. An "adipogenic factor" refers to a factor (including, e.g., a 
protein (or glycoprotein)) that can induce or stimulate the differentiation of cells into an 
adipocyte. Exemplary adipogenic factors include, e.g., _Wntl0b, Pref-1,ADF and TNF-alpha. 

(35] The term "lipid metabolism'* refers to the in vivo process of catah^lism 
(decomposition) and anabolism (accumulation) of lipids (e.g., triglycerides derived from 
food) and is intended to include, in the broad sense, reactions for transforming lipids into 
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energy, biosynthesis of fatty acids, acylglycerol, phospholipid metabolism and cholesterol 
metabolism. 

[36] "Activity" of a polypeptide of the invention refers to structural, 
regulatory, or biochemical functions of a polypeptide in its native cell or tissue. Examples of 
5 activity of a polypeptide include both direct activities and indirect activities. Exemplary 
direct activities are the result of direct interaction with the polypeptide, , e.g., enzymatic 
activity, ligand binding, production or depletion of second messengers (e.g M cAMP, cGMP, 
IPj, DAG, or Ca 2+ ), ion flux, phosphorylation levels, transcription levels, and the like. 
Exemplary indirect activities are observed as a change in phenotype or response in a cell or 

1 0 tissue to a polypeptide's directed activity, e.g., loss of body weight or molecular events 

associated with loss of body weight or obesity or modulating insulin sensitivity of a cell as a 
result of the interaction of the polypeptide with other cellular or tissue components. 

[37] "Predisposition for diabetes" occurs in a person when the person is at 
high risk for developing diabetes. A number of risk factors are known to those of skill in the 

15 art and include: genetic factors (e.g., carrying alleles that result in a higher occurrence of 
diabetes than in the average population or having parents or siblings with diabetes); 
overweight (e.g., body mass index (BMI) greater or equal to 25 kg/m 2 ); habitual physical 
inactivity, race/ethnicity (e.g., African- American, Hispanic-American, Native Americans, 
Asian-Americans, Pacific Islanders); previously identified impaired fasting glucose or 

20 impaired glucose tolerance, hypertension (e.g., greater or equal to 140/90 mmHg in adults); 
HDL cholesterol less than or equal to 35 mg/dl; triglyceride levels greater or equal to 250 
mg/dl; a history of gestational diabetes or delivery of a baby over nine pounds; and/or 
polycystic ovary syndrome. See, e.g.. "Report of the Expert Committee on the Diagnosis and 
Classification of Diabetes Mellitus" and "Screening for Diabetes" Diabetes Care 25(1): S5- 

25 S24(2002). 

[38) A "lean individual," when used to compare with a sample from a 
patient, refers to an adult with a fasting blood glucose level less than 100 mg/dl or a 2 hour 
PG reading of 140 mg/dl. "Fasting" refers to no caloric intake for at least 8 hours. A "2 hour 
PG" refers to the level of blood glucose after challenging a patient to a glucose load 
30 containing the equivalent of 75g anhydrous glucose dissolved in water. The overall test is 

generally refenred to as an oral glucose tolerance test (OGTT*. See, e.g., Diabetes Care, 2003, 
26(1 1): 3160-3167 (2003). The level of a polypeptide in a lean individual can be a reading 
from a single individual, but is typically a statistically relevant average from a group of lean 
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individuals. The level of a polypeptide in a lean individual can be represented by a value, for 
example in a computer program. 

[39J A "pre-diabetic individual," when used to compare with a sample from 
a patient, refers to an adult with a fasting blood glucose level greater than 100 mg/dl but less 
5 than 126 mg/dl or a 2 hour PG reading of greater than 140 mg/dl but less than 200mg/dl. A 
"diabetic individual," when used to compare with a sample from a patient, refers to an adult 
with a fasting blood glucose level greater than 126 mg/dl or a 2 hour PG reading of greater 
than 200 mg/dl. 

(40] An "agonist" refers to an agent that binds to, stimulates, increases, 

10 activates, facilitates, enhances activation, sensitizes or up regulates the activity or expression 
of a polypeptide of the invention. 

[41 ) An "antagonist" refers to an agent that binds to, partially or totally 
blocks stimulation, decreases, prevents, delays activation, inactivates, desensitizes, or down 
regulates the activity or expression of a polypeptide of the invention. 

15 [42] "Antibody" refers to a polypeptide substantially encoded by an 

immunoglobulin gene or immunoglobulin genes, or fragments thereof which specifically bind 
and recognize an analyte (antigen). The recognized immunoglobulin genes include the kappa, 
lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as the myriad 
immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. 

20 Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the 
immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. 

[43] An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetraraer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 

25 of each chain defines a variable region of about 1 00 to 1 1 0 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (V L ) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

[44] Antibodies exist, e.g., as intact immunoglobulins or as a number of 
well-characterized fragments produced by digestion with various peptidases. Thus, for 

30 example, pepsin digests an antibody below the disulfide linkages in the hinge region to 
produce F(ab)' 2 , a dimer of Fab which itself is a light chain joined to Vn-Cril by a disulfide 
bond. The F(ab)' 2 may be reduced under mild conditions to break the disulfide linkage in the 
hinge region, thereby converting the F(ab)' 2 dimer into an Fab 1 monomer The Fab' monomer 
is essentially an Fab with part of the hinge region (see, Paul (Ed.) Fundamental Immunology, 
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Third Edition, Raven Press, NY (1993)). While various antibody fragments are defined in 
terms of the digestion of an intact antibody, one of skill will appreciate that such fragments 
may be synthesized de novo either chemically or by utilizing recombinant DNA methodology. 
Thus, the term antibody, as used herein, also includes antibody fragments either produced by 
5 the modification of whole antibodies or those synthesized de novo using recombinant DNA 
methodologies {e.g.. single chain Fv). 

[45] The terms t4 peptidomimetic" and "mimetic*' refer to a synthetic 
chemical compound that has substantially the same structural and functional characteristics of 
the antagonists or agonists of the invention. Peptide analogs are commonly used in the 

10 pharmaceutical industry as non-peptide drugs with properties analogous to those of the 
template peptide. These types of non-peptide compound are termed "peptide mimetics" or 
"peptidomimetics" (Fauchere, J. Adv. Drug Res. 15:29 (1986); Veber and Freidinger TINS p. 
392 (1985); and Evans etal J. Med. Chem. 30:1229 (1987), which are incorporated herein by 
reference). Peptide mimetics that are structurally similar to therapeutically useful peptides 

1 5 may be used to produce an equivalent or enhanced therapeutic or prophylactic effect. 
Generally, peptidomimetics are structurally similar to a paradigm polypeptide (i.e., a 
polypeptide that has a biological or pharmacological activity), such as a polypeptide 
exemplified in this application, but have one or more peptide linkages optionally replaced by 
a linkage selected from the group consisting of, e.g., -CH2NH-, -CH2S-, -CH2-CH2-, - 

20 CH=CH- (cis and trans), -COCH2-, -CH(OH)CH2-, and -CH2SO-. The mimetic can be 

either entirely composed of synthetic, non-natural analogues of amino acids, or, is a chimeric 
molecule of partly natural peptide amino acids and partly non-natural analogs of ammo acids. 
The mimetic can also incorporate any amount of natural amino acid conservative 
substitutions as long as such substitutions also do not substantially alter the mimetic's 

25 structure and/or activity. For example, a mimetic composition is within the scope of the 
invention if it is capable of carrying out the binding or other activities of an agonist or 
antagonist of a polypeptide of the invention. 

(46] The term "gene" means the segment of DNA involved in producing a 
polypeptide chain; it includes regions preceding and following the coding region (leader and 

30 trailer) as well as intervening sequences (introns) between individual coding segments 
(exons). 

[47] The term "isolated," when applied to a nucleic acid or protein, denotes 
that the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It may be in a homogeneous state although it can be in either a 
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dry or aqueous solution. Purity and homogeneity are typically determined using analytical 
chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein that is the predominant species present in a preparation is 
substantially purified. In particular, an isolated gene is separated from open reading frames 
5 that flank the gene and encode a protein other than the gene of interest. The term "purified" 
denotes that a nucleic acid or protein gives rise to essentially one band in an electrophoretic 
gel. Particularly, it means that the nucleic acid or protein is at least 85% pure, more 
preferably at least 95% pure, and most preferably at least 99% pure. . 

[48] The term "nucleic acid" or "polynucleotide" refers to 

10 deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double- 
stranded form. Unless specifically limited, the term encompasses nucleic acids containing 
known analogues of natural nucleotides that have similar binding properties as the reference 
nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. 
Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses 

1 5 conservatively modi fied variants thereof (e.g., degenerate codon substitutions) and 
complementary sequences as well as the sequence explicitly indicated. Specifically, 
degenerate codon substitutions may be achieved by generating sequences in which the third 
position of one or more selected (or all) codons is substituted with mixed-base and/or 
deoxyinosine residues (Batzer et al. 7 Nucleic Acid Res, 19:5081 (1991); Ohtsuka et a/,, J. Biol 

20 Chem. 260:2605-2608 (1985); and Cassol et al. (1992); Rossolini et al. y Mol Cell Probes 
8:91-98 (1994)). The term nucleic acid is used interchangeably with gene, cDNA, and 
mRNA encoded by a gene. 

[49] The terms "polypeptide/* "peptide" and "protein" are used 
interchangeably herein to refer to a polymer of amino acid residues. The terms apply to 

25 amino acid polymers in which one or more amino acid residue is an artificial chemical 

mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring 
amino acid polymers and non-naturally occurring amino acid polymers. As used herein, the 
terms encompass amino acid chains of any length, including full-length proteins (/.&, 
antigens), wherein the amino acid residues are linked by covalent peptide bonds. 

30 [50J The term "amino acid" refers to naturally occurring and synthetic 

amino acids, as well as amino acid analogs and amino acid mimetics that function in a 
manner similar to the naturally occurring amino acids. Naturally occurring amino acids are 
those encoded by the genetic code, as well as those amino acids that are later modified, e.g., 
hydroxyproline, 7-carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to 
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compounds that have the same basic chemical structure as a naturally occurring amino acid, 
i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R 
group, e.g., homoserine, norleufcine, methionine sulfoxide, methionine methyl sulfonium. 
Such analogs have modified R groups (eg., norleucine) or modified peptide backbones, but 
5 retain the same basic chemical structure as a naturally occurring amino acid. "Amino acid 
mimetics" refers to chemical compounds that have a structure that is different from the 
general chemical structure of an amino acid, but which functions in a manner similar to a 
naturally occurring amino acid. 

[51 J Amino acids may be referred to herein by either the commonly known 

10 three letter symbols or by the one-letter symbols recommended by the TUPAC-IUB 

Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their 
commonly accepted single-letter codes. 

[52] "Conservatively modified variants" applies to both amino acid and 
nucleic acid sequences. With respect to particular nucleic acid sequences, "conservatively 

1 5 modified variants" refers to those nucleic acids that encode identical or essentially identical 
amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 
number of functionally identical nucleic acids encode any given protein. For instance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 

20 position where an alanine is specified by a codon, the codon can be altered to any of the 

corresponding codons described without altering the encoded polypeptide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified variations. 
Every nucleic acid sequence herein that encodes a polypeptide also describes every possible 
silent variation of the nucleic acid. One of skill will recognize that each codon in a nucleic 

25 acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is 
ordinarily the only codon for tryptophan) can be modified to yield a functionally identical 
molecule. Accordingly, each silent variation of a nucleic acid that encodes a polypeptide is 
implicit in each described sequence. 

[53] As to amino acid sequences, one of skill will recognize that individual 

30 substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

sequence which altcs, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
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the art Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention. 

[54] The following eight groups each contain amino acids that are 
conservative substitutions for one another: 



5 o 


Alanine (A), Glycine (G); 


2) 


Aspartic acid (D), Glutamic acid (E); 


3) 


Asparagine (N), Glutamine (Q); 


4) 


Arginine (R), Lysine (K); 


5) 


Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 


10 6) 


Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 


7) 


Serine (S), Threonine (T); and 


8) 


Cysteine (C), Methionine (M) 



(see, e.g., Creighton, Proteins (1984)). 

(55) "Percentage of sequence identity" is determined by comparing two 

1 5 optimally aligned sequences over a comparison window, wherein the portion of the 

polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., 
gaps) as compared to the reference sequence (e.g., a polypeptide of the invention), which 
does not comprise additions or deletions, for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical nucleic 

20 acid base or amino acid residue occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of positions in the 
window of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. 

[S6J The terms "identical" or percent "identity," in the context of two or 
25 more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 
that are the same sequences. Sequences are "substantially identical" if two sequences have a 
specified percentage of amino acid residues or nucleotides that are the same ({«£•» 60% 
identity, optionally 65%, 70%, 75%, 80%, 85%, 90%, or 95% identity over a specified region, 
or, when not specified, over the entire sequence), when compared and aligned for maximum 
30 correspondence over a comparison window, designated region as measured using one of the 
following sequence comparison algorithms or by manual alignment and visual inspection, or 
across the entire sequence where not indicated. The invention provides polypeptides or 
polynucleotides that are substantially identical to the polypeptides or polynucleotides, 
respectively, exemplified herein (e.g., SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 
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15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 
40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 
65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 
90,91,92,93,94,95,96,97,98,99,101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 
5 1 12, 1 13, 1 14, 1 15, 1 16, 1 17, 1 18, 1 19, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 
131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 
150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 
169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 201, 202, 203, 204, 205, 206, 207, 
10 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 
227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 
246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 
265, 266, 267, 268, 269, 270, 271, 272, 273 or 274). This definition also refers to the 
complement of a test sequence. Optionally, the identity exists over a region that is at least 
1 5 about 50 nucleotides in length, or more preferably over a region that is 100 to 500 or 1000 or 
more nucleotides in length. 

157] For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence comparison 
algorithm, test and reference sequences are entered into a computer, subsequence coordinates 
20 are designated, if necessary, and sequence algorithm program parameters are designated. 
Default program parameters can be used, or alternative parameters can be designated. The 
sequence comparison algorithm then calculates the percent sequence identities for the test 
sequences relative to the reference sequence, based on the program parameters. 

[58] A "comparison window**, as used herein, includes reference to a 
25 segment of any one of the number of contiguous positions selected from the group consisting 
of from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in 
which a sequence may be compared to a reference sequence of the same number of 
contiguous positions after the two sequences are optimally aligned. Methods of alignment of 
sequences for comparison are well known in the art. Optimal alignment of sequences for 
30 comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman 
(1970) Adv. Appl Math. 2:482c, by the homblogy alignment algorithm of Needleman and 
Wunsch (1970) J. Mol Biol. 48:443, by the search for similarity method of Pearson and 
Lipman (1988) Proa Nat 1 Acad. ScL USA 85:2444, by computerized implementations of 
these algorithms (GAP, BESTFIT, FAST A, and TFASTA in the Wisconsin Genetics 

18 

BNSDOCIO: <WO__200600740CA2J_> 



WO 2006/007400 



PCT/US2005/O21297 



Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual 
alignment and visual inspection (see, e.g., Ausubel et al. 9 Current Protocols in Molecular 
Biology (1995 supplement)). 

[59] Two examples of algorithms that are suitable for determining percent 
5 sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et al. (1977) Nuc. Acids Res. 25:3389-3402, and Altschul et al. 
(1990) J. Mol Biol. 21 5:403-41 0, respectively. Software for performing BLAST analyses is 
publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

10 sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive-valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for 
initiating searches to find longer HSPs containing them. The word hits are extended in both 

1 5 directions along each sequence for as far as the cumulative alignment score can be increased. 
Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward 
score for a pair of matching residues; always > 0) and N (penalty score for mismatching 
residues; always < 0). For amino acid sequences, a scoring matrix is used to calculate the 
cumulative score. Extension of the word hits in each direction are halted when: the 

20 cumulative alignment score falls off by the quantity X from its maximum achieved value; the 
cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 
program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , an expectation 

25 (E) or 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad Sci. USA 
89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N—4, and a comparison of both 
strands. 

30 (60] The BLAST algorithm also performs a statistical analysis of the 

similarity between two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. 
USA 90:5873-5787). One measure of similarity provided by the BLAST algorithm is the 
smallest sum probability (P(N))» which provides an indication of the probability by which a 
match between two nucleotide or amino acid sequences would occur by chance. For example, 
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a nucleic acid is considered similar to a reference sequence if the smallest sum probability in 
a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01 , and most preferably less than about 0.001 . 

[61] An indication that two nucleic acid sequences or polypeptides are 
5 substantially identical is that the polypeptide encoded by the first nucleic acid is 

immunologically cross reactive with the antibodies raised against the polypeptide encoded by 
the second nucleic acid, as described below. Thus, a polypeptide is typically substantially 
identical to a second polypeptide, for example, where the two peptides differ only by 
conservative substitutions. Another indication that two nucleic acid sequences are 
10 substantially identical is that the two molecules or their complements hybridize to each other 
under stringent conditions, as described below. Yet another indication that two nucleic acid 
sequences are substantially identical is that the same primers can be used to amplify the 
sequence. 

[62] The phrase "selectively (or specifically) hybridizes to" refers to the 

1 5 binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence 
under stringent hybridization conditions when that sequence is present in a complex mixture 
(e.g., total cellular or library DNA or RNA). 

[63] The phrase "stringent hybridization conditions 1 * refers to conditions 
under which a probe will hybridize to its target subsequence, typically in a complex mixture 

20 of nucleic acid, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 

25 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
J of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 

30 equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 
50 nucleotides) and at least about 60° C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
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formamide. For selective or specific hybridization, a positive signal is at least two times 
background, optionally 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5X SSC, and 1% SDS, 
incubating at 42°C, or 5X SSC, 1% SDS, incubating at 65°C, with wash in 0.2X SSC, and 
5 0. 1% SDS at 55°C, 60°C, or 65°C. Such washes can be performed for 5, 15, 30, 60, 120, or 
more minutes. 

[64] Nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially identical if the polypeptides that they encode are 
substantially identical. This occurs, for example, when a copy of a nucleic acid is created 

1 0 using the maximum codon degeneracy permitted by the genetic code. In such cases, the 
nucleic acids typically hybridize under moderately stringent hybridization conditions. 
Exemplary "moderately stringent hybridization conditions" include a hybridization in a 
buffer of 40% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in IX SSC at 45°C. Such 
washes can be performed for 5, 15, 30, 60, 120, or more minutes. A positive hybridization is 

15 at least twice background. Those of ordinary skill will readily recognize that alternative 

hybridization and wash conditions can be utilized to provide conditions of similar stringency. 

[65] The phrase "a nucleic acid sequence encoding" refers to a nucleic acid 
which contains sequence information for a structural RNA such as rRNA, a tRNA, or the 
primary amino acid sequence of a specific protein or peptide, or a binding site for a trans- 

20 acting regulatory agent. This phrase specifically encompasses degenerate codons (i.e., 

different codons which encode a single amino acid) of the native sequence or sequences that 
may be introduced to conform with codon preference in a specific host cell. 

[66] The term "recombinant" when used with reference, e.g., to a cell, or 
nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has 

25 been modified by the introduction of a heterologous nucleic acid or protein or the alteration 
of a native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, 
for example, recombinant cells express genes that are not found within the native 
(nonrecombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under-expressed or not expressed at all. 

30 [67] The term "heterologous" when used with reference to portions of a 

nucleic acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences from unrelated genes 
arranged to make a new functional nucleic acid, e.g. t a promoter from one source and a 
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coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship to 
each other in nature (e.g., a fusion protein). 

[68] An "expression vector 1 ' is a nucleic acid construct, generated 
5 recombinant^ or synthetically, with a series of specified nucleic acid elements that permit 
transcription of a particular nucleic acid in a host cell. The expression vector can be part of a 
plasmid, virus, or nucleic acid fragment Typically, the expression vector includes a nucleic 
acid to be transcribed operably linked to a promoter, 

[69] The phrase "specifically (or selectively) binds to an antibody" or 
10 "specifically (or selectively) immunoreactive with", when referring to a protein or peptide, 
refers to a binding reaction which is determinative of the presence of the protein in the . 
presence of a heterogeneous population of proteins and other biologies. Thus, under 
designated immunoassay conditions, the specified antibodies bind to a particular protein and 
do not bind in a significant amount to other proteins present in the sample. Specific binding 

15 to an antibody under such conditions may require an antibody that is selected for its 

specificity for a particular protein. For example, antibodies raised against a protein having an 
amino acid sequence encoded by any of the polynucleotides of the invention can be selected 
to obtain antibodies specifically immunoreactive with that protein and not with other proteins, 
except for polymorphic variants. A variety of immunoassay formats may be used to select 

20 antibodies specifically immunoreactive with a particular protein. For example, solid-phase 
ELISA immunoassays, Western blots, or immunohistochemistry are routinely used to select 
monoclonal antibodies specifically immunoreactive with a protein. See, Harlow and Lane 
Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, NY (1988) for a 
description of immunoassay formats and conditions that can be used to determine specific 

25 immunoreactivity. Typically, a specific or selective reaction will be at least twice the 
background signal or noise and more typically more than 10 to 100 times background. 

[70] "Inhibitors," "activators," and "modulators" of expression or of 
activity are used to refer to inhibitory, activating, or modulating molecules, respectively, of 
expression of the polypeptides of the invention as determined using in vitro or in vivo assays 

30 \o monitor expression or activity. Modulators encompass e.g., ligands, agonists, antagonists, 
fhsir homologs and mimetics, as well as the polypeptides of the invention, or fragments 
thereof with antagonist activity or that act to increase overall polypeptide activity (i.e., 
fragments that have at least some of the activity of the full-length protein). In some cases, 
fragments of the polypeptides of the invention are at least 20, 50, 75 or 100 amino acids in 
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length. The term <4 modulator" includes inhibitors and activators. Inhibitors aie agents that, 
e.g., inhibit expression of a polypeptide of the invention or bind to, partially or totally block 
stimulation, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity of a polypeptide of the invention, e.g., antagonists. Activators are agents that, 
induce or activate the expression of a polypeptide of the invention or bind to, stimulate, 
increase, open, activate, facilitate, or enhance activation, sensitize or up regulate the activity 
of a polypeptide of the invention, ££., agonists. Modulators include naturally occurring and 
synthetic ligands, antagonists, agonists, small chemical molecules and the like. Such assays 
for inhibitors and activators include, eg., applying putative modulator compounds to cells 
expressing a polypeptide of the invention and then determining the functional effects on a 
polypeptide of the invention activity, as described above. Samples or assays comprising a 
polypeptide of the invention that are treated with a potential activator, inhibitor, or modulator 
are compared to control samples without the inhibitor, activator, or modulator to examine the 
extent of effect. Control samples (untreated with modulators) are assigned a relative activity 
value of 100%. Inhibition of a polypeptide of the invention is achieved when the polypeptide 
activity value relative to the control is about 80%, optionally 50% or 25, 10%, 5% or 1%. 
Activation of the polypeptide is achieved when the polypeptide activity value relative to the 
control is 1 1 0%, optionally 1 50%, optionally 200, 300%, 400%, 500%, or 1 0000000% or 
more higher. 

DETAILED DESCRIPTION OF THE INVENTION 
I, INTRODUCTION 

[711 The present application demonstrates that, surprisingly, modulated 
levels of mRNA comprising sequences of the invention occur in human adipose tissue 
collected from either insulin resistant obese non-diabetics or from type 2 diabetic individuals 
compared to levels of the mRNA in the lean, non-diabetic individuals. Insulin resistant obese 
individuals are generally predisposed to become type II diabetics. Therefore, the modulation 
of the sequences in the study described herein indicates the sequences* involvement in 
obesity, diabetes and/or pre-diabetes. 

[72J Without intending to limit the invention to a particular mechanism of 
action, it !*: believed that modulation of the expression or activity of the polypeptides or 
polynucleotides of the invention is beneficial in treating obesity, diabetic, pre-diabetic or 
insulin resistant, non-diabetic patients. Furthermore, modulated levels of the polypeptides of 
the invention are indicative of insulin resistance, obesity, diabetes or a predisposition for 
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obesity and/or diabetes. Thus, the detection of a polypeptide of the invention is useful for 
diagnosis of obesity, predisposition for obesity and/or diabetes, diabetes and/or insulin 
resistance. 

(73J This invention also provides methods of using polypeptides of the 
5 invention and modulators of the polypeptides of the invention to diagnose and treat obesity, 
diabetes, pre-diabetes (including insulin resistant individuals) and related metabolic diseases. 
The present method also provides methods of identifying modulators of expression or activity 
of the polypeptides of the invention. Such modulators are useful for treating obesity and/or 
Type 2 diabetes as well as the pathological aspects of obesity (e.g., increased risk for 
1 0 cardiovascular disease, hypertension or cancer) and/or diabetes (e.g., insulin resistance). 

II. GENERAL RECOMBINANT NUCLEIC ACID METHODS FOR USE 
WITH THE INVENTION 

[74] In numerous embodiments of the present invention, nucleic acids 

encoding a polypeptide of the present invention will be isolated and cloned using 

1 5 recombinant methods. Such embodiments are used, e.g., to isolate polynucleotides identical 
or substantially identical to SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 
31, 34, 36, 38, 41, 43, 45, 48, 50, 52, 54, 56, 58, 60, 62, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 
85, 87, 89,91,93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 1 17, 1 19, 121, 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 162, 164, 

20 166, 168, 170, 172, 174, 176, 178, 180, 183, 185, 187, 189, 191, 193, 195, 198, 200, 202, 204, 
206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 231, 233, 235, 237, 239, 241, 244, 
246, 248, 250, 253, 255, 257, 259, 261, 264, 266, 268, 271 or 273 for protein expression or 
during the generation of variants, derivatives, expression cassettes, or other sequences 
derived from an polypeptide or polynucleotide of the invention, to monitor gene expression, 

25 for the isolation or detection of sequences in different species, for diagnostic purposes in a 
patient, e.g., to detect mutations in a polypeptide or polynucleotide of the invention or to 
detect expression levels of nucleic acids or polypeptides. In some embodiments, the 
sequences encoding the polypeptides of the invention (or polypeptides comprising fragments 
of the polypeptides of the invention) are operably linked to a heterologous promoter. In some 

30 cases, fragments of the polypeptides of the invention are at least 20, 50, 75 or 100 amino 
acids in length. The polypeptides of the invention can be linked to heterologous amino acid 
sequences using recombinant DNA technology. In one embodiment, the nucleic acids of the 
invention are from any mammal, including, in particular, e.g., a human, a mouse, a rat, etc. 
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[75] Polynucleotides, including expression cassettes, encoding polypeptides 
of the invention can be introduced into cells and optionally expressed in the cells. 
Polynucleotides of the invention can be introduced into eukaryotic or prokaryotic cells, 
including adipocyte or muscle cells. The cells can be primary cells or cell lines. 

5 A. General Recombinant Nucleic Acid Methods 

[76] This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
Sambrook et aL, Molecular Cloning, A Laboratory Manual (3rd ed. 2001); Kriegler, Gene 
Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in Molecular 
10 Biology (Ausubel et aL 9 eds., 1994)). 

[77] For nucleic acids, sizes are given in either kilobases (kb) or base pairs 
(bp). These are estimates derived from agarose or acrylamide gel electrophoresis, from 
sequenced nucleic acids, or from published DNA sequences. For proteins, sizes are given in 
kilodaltons (kDa) or amino acid residue numbers. Proteins sizes are estimated from gel 
15 electrophoresis, from sequenced proteins, from derived amino acid sequences, or from 
published protein sequences. 

[78] Oligonucleotides that are not commercially available can be 
chemically synthesized according to the solid phase phosphoramidite triester method first 
described by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an 
20 automated synthesizer, as described in Van Devanter et al , Nucleic Acids Res. 12:6159-6168 
(1984). Purification of oligonucleotides is by either native acrylamide gel electrophoresis or 
by anion-exchange HPLC as described in Pearson & Reanier, J. Chrom. 255:137-149 (1983). 

|79] The sequence of the cloned genes and synthetic oligonucleotides can 
be verified after cloning using, e.g., the chain termination method for sequencing double- 
25 stranded templates of Wallace et al. Gene 16:21-26 (1981), 

B* Cloning Methods for the Isolation of Nucleotide Sequences Encoding 
Desired Proteins 

[80] In general, the nucleic acids encoding the subject proteins are cloned 
from DNA sequence libraries that are made to encode cDNA or genomic DNA. The 
30 particular sequences can be located by hybridizing with an oligonucleotide probe, the 
sequence of which can be derived from the sequences disclosed herein, which provide a 
reference for PCR primers and defines suitable regions for isolating probes specific for the 
polypeptides or polynucleotides of the invention. Alternatively, where the sequence is cloned 
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into an expression library, the expressed recombinant protein can be detected 
immunologically with antisera or purified antibodies made against a polypeptide of interest, 
including those disclosed herein. 

|81 ] Methods for making and screening genomic and cDNA libraries are 
5 well known to those of skill in the art (see, e.g., Gubler and Hoffman Gene 25:263-269 
(1983); Benton and Davis Science, 196:180-182 (1977); and Sambrook, supra). 

[82] Briefly, to make the cDNA library, one should choose a source that is 
rich in mRNA. The mRNA can then be made into cDNA, ligated into a recombinant vector, 
and transfected into a recombinant host for propagation, screening and cloning. For a 

1 0 genomic library, the DNA is extracted from a suitable tissue and either mechanically sheared 
or enzymatically digested to yield fragments of preferably about 5-100 kb. The fragments are 
then separated by gradient centrifugation from undesired sizes and are constructed in 
bacteriophage lambda vectors. These vectors and phage are packaged in vitro, and the 
recombinant phages are analyzed by plaque hybridization. Colony hybridization is carried 

1 5 out as generally described in Grunstein et aL, Proa Natl Acad. Sci USA., 72:3961-3965 
(1975). 

[83] An alternative method combines the use of synthetic oligonucleotide 
primers with polymerase extension on an mRNA or DNA template. Suitable primers can be 
designed from specific sequences disclosed herein. This polymerase chain reaction (PCR) 

20 method amplifies the nucleic acids encoding the protein of interest directly from mRNA, 
cDNA, genomic libraries or cDNA libraries. Restriction endonuclease sites can be 
incorporated into the primers. Polymerase chain reaction or other in vitro amplification 
methods may also be useful, for example, to clone nucleic acids encoding specific proteins 
and express said proteins, to synthesize nucleic acids that will be used as probes for detecting 

25 the presence of mRNA encoding a polypeptide of the invention in physiological samples, for 
nucleic acid sequencing, or for other purposes (see, U.S. Patent Nos. 4,683,195 and 
4,683,202). Genes amplified by a PCR reaction can be purified from agarose gels and cloned 
into an appropriate vector. 

[84] Appropriate primers and probes for identifying the genes encoding a 

30 polypeptide of the invention from mammalian tissues can be derived from the sequences 

provided herein. For a general overview of PCR, see, Innis et al PCR Protocols: A Guide to 
Methods and Applications, Academic Press, San Diego (1990). 

[85] Synthetic oligonucleotides can be used to construct genes. This is 
done using a series of overlapping oligonucleotides, usually 40-120 bp in length, representing 
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both the sense and anti-sense strands of the gene. These DNA fragments are then annealed, 
Hgated and cloned. 

[86] A polynucleotide encoding a polypeptide of the invention can be 
cloned using intermediate vectors before transformation into mammalian cells for expression. 
These intermediate vectors are typically prokaryote vectors or shuttle vectors. The proteins 
can be expressed in either prokaryotes or eukaryotes, using standard methods well known to 
those of skill in the art. 

III. PURIFICATION OF PROTEINS OF THE INVENTION 

[87] Either naturally occurring or recombinant polypeptides of the invention 
can be purified for use in functional assays. Naturally occurring polypeptides of the 
invention can be purified from any source (e.g., tissues of an organism expressing an 
ortholog). Recombinant polypeptides can be purified from any suitable expression system. 

[88] The polypeptides of the invention may be purified to substantial purity 
by standard techniques, including selective precipitation with such substances as ammonium 
sulfate; column chromatography, immunopurification methods, and others (see, e.g., Scopes, 
Protein Purification: Principles and Practice (1982); U.S. Patent No. 4,673,641; Ausubel et 
aL, supra; and Sambrook et al. t supra). 

[89] A number of procedures can be employed when recombinant 
polypeptides are being purified. For example, proteins having established molecular 
20 adhesion properties can be reversibly fused to a polypeptide of the invention. With the 
appropriate ligand, either protein can be selectively adsorbed to a purification column and 
then freed from the column in a relatively pure form. The fused protein may be then removed 
by enzymatic activity. Finally polypeptides can be purified using immunoaffinity columns. 

A. Purification of Proteins from Recombinant Bacteria 
25 [90] When recombinant proteins are expressed by the transformed bacteria 

in large amounts, typically after promoter induction, although expression can be constitutive, 
the proteins may form insoluble aggregates. There are several protocols that are suitable for 
purification of protein inclusion bodies. For example, purification of aggregate proteins 
(hereinafter referred to as inclusion bodies) typically involves the extraction, separation 
30 and/or purification of inclusion bodies by disruption of bacterial cells typically, but not 

limited to, by incubation in a buffer of about 100-150 jig/ml lysozyme and 0.1% Nonidet P40, 
a non-ionic detergent. The cell suspension can be ground using a Polytron grinder (Brinkman 
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Instruments, Westbury, NY). Alternatively, the cells can be sonicated on ice. Alternate 
methods of lysing bacteria are described in Ausubel et al and Sambrook et a/., both supra, 
and will be apparent to those of skill in the art. 

(91 J The cell suspension is generally centrifiiged and the pellet containing 
the inclusion bodies resuspended in buffer which does not dissolve but washes the inclusion 
bodies, e.g. t 20 mM Tris-HCl (pH 7.2), 1 mM EDTA, 150 mM NaCl and 2% Triton-X 100, a 
non-ionic detergent. It may be necessary to repeat the wash step to remove as much cellular 
debris as possible. The remaining pellet of inclusion bodies may be resuspended in an 
appropriate buffer (e.g., 20 mM sodium phosphate, pH 6.8, 150 mM NaCl). Other 
appropriate buffers will be apparent to those of skill in the art, 

(92] Following the washing step, the inclusion bodies are solubilized by the 
addition of a solvent that is both a strong hydrogen acceptor and a strong hydrogen donor (or 
a combination of solvents each having one of these properties). The proteins that formed the 
inclusion bodies may then be renatured by dilution or dialysis with a compatible buffer. 
Suitable solvents include, but are not limited to, urea (from about 4 M to about 8 M), 
formamide (at least about 80%, volume/volume basis), and guanidine hydrochloride (from 
about 4 M to about 8 M). Some solvents that are capable of solubilizing aggregate-forming 
proteins, such as SDS (sodium dodecyl sulfate) and 70% formic acid, are inappropriate for 
use in this procedure due to the possibility of irreversible denaturation of the proteins, 
accompanied by a lack of immunogenicity and/or activity. Although guanidine 
hydrochloride and similar agents are denaturants, this denaturation is not irreversible and 
renaturation may occur upon removal (by dialysis, for example) or dilution of the denaturant, 
allowing re-formation of the immunologically and/or biologically active protein of interest. 
After solubilization, the protein can be separated from other bacterial proteins by standard 
separation techniques. 

[93] Alternatively, it is possible to purify proteins from bacteria periplasm. 
Where the protein is exported into the periplasm of the bacteria, the periplasmic fraction of 
the bacteria can be isolated by cold osmotic shock in addition to other methods known to 
those of skill in the art (see, Ausubel et a/., supra). To isolate recombinant proteins from the 
periplasm, the bacterial cells are centrifuged to form a pellet. The pellet is resuspended in a 
buffer containing 20% sucrose. To lyse the cells, the bacteria are centrifuged and the pellet is 
resuspended in ice-cold 5 mM MgS0 4 and kept in an ice bath for approximately 10 minutes. 
The cell suspension is centrifuged and the supernatant decanted and saved. The recombinant 
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proteins present in the supernatant can be separated from the host proteins by standard 
separation techniques well known to those of skill in the art. 

B. Purification of Proteins from Insect Cells 

5 [94] Proteins can also be purified from eukaryotic gene expression systems 

as described in, e.g., Fernandez and Hoeffler, Gene Expression Systems (1999). In some 
embodiments, baculovirus expression systems are used to isolate proteins of the invention. 
Recombinant baculoviruses are generally generated by replacing the polyhedrin coding 
sequence of a baculovirus with a gene to be expressed (e.g., encoding a polypeptide of the 

1 0 invention). Viruses lacking the polyhedrin gene have a unique plaque morphology making 
them easy to recognize. In some embodiments, a recombinant baculovirus is generated by 
first cloning a polynucleotide of interest into a transfer vector (e.g., a pUC based vector) such 
that the polynucleotide is operably linked to a polyhedrin promoter. The transfer vector is 
transfected with wildtype DNA into an insect cell (e.g., Sf9, SGI or BT1-TN-5B1-4 cells), 

1 5 resulting in homologous recombination and replacement of the polyhedrin gene in the 
wildtype viral DNA with the polynucleotide of interest. Virus can then be generated and 
plaque purified. Protein expression results upon viral infection of insect cells. Expressed 
proteins can be harvested from cell supernatant if secreted, or from cell lysates if intracellular. 
See, e.g., Ausubel et al and Fernandez and Hoeffler, supra. 

20 

C. Purification of secreted proteins from mammalian cells 

[95] Polypeptides of the invention, and in particular, secreted proteins of the 
invention can be readily purified from mammalian cells expressing the polypeptides. 
Expression of the polypeptides can be the result of either transient or stable expression of the 
25 protein from a recombinant expression cassette introduced into the cells. Secreted proteins 
can generally be isolated using standard procedures to purify the proteins from the cell 
culture medium. 

D. Standard Protein Separation Techniques For Purifying Proteins 

•30 1 . Solubility Fractionation 

196] Often as an initial step, and if the protein mixture is complex, an initial 
salt fractionation can separate many of the unwanted host cell proteins (or proteins derived 
from the cell culture media) from the recombinant protein of interest. The preferred salt is 
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ammonium sulfate. Ammonium sulfate precipitates proteins by effectively reducing the 
amount of water in the protein mixture. Proteins then precipitate on the basis of their 
solubility. The more hydrophobic a protein is, the more likely it is to precipitate at lower 
ammonium sulfate concentrations. A typical protocol is to add saturated ammonium sulfate 
5 to a protein solution so that the resultant ammonium sulfate concentration is between 20*30%. 
This will precipitate the most hydrophobic proteins. The precipitate is discarded (unless the 
protein of interest is hydrophobic) and ammonium sulfate is added to the supernatant to a 
concentration known to precipitate the protein of interest. The precipitate is then solubilized 
in buffer and the excess salt removed if necessary, through either dialysis or diafiltration. 
1 0 Other methods that rely on solubility of proteins, such as cold ethanol precipitation, are well 
known to those of skill in the art and can be used to fractionate complex protein mixtures. 

2. Size Differential Filtration 

[97] Based on a calculated molecular weight, a protein of greater and lesser 
size can be isolated using ultrafiltration through membranes of different pore sizes (for 

15 example, Amicon or Millipore membranes). As a first step, the protein mixture is 

ultrafiltered through a membrane with a pore size that has a lower molecular weight cut-off 
than the molecular weight of the protein of interest. The retentate of the ultrafiltration is then 
ultrafiltered against a membrane with a molecular cut off greater than the molecular weight of 
the protein of interest. The recombinant protein will pass through the membrane into the 

20 filtrate. The filtrate can then be chromatographed as described below. 

3. Column Chromatography 

[98] The proteins of interest can also be separated from other proteins on 
the basis of their size, net surface charge, hydrophobicity and affinity for ligands. In addition, 
antibodies raised against proteins can be conjugated to column matrices and the proteins 
25 immunopurified. All of these methods are well known in the art. 

[99] Immunoaffinity chromatography using antibodies raised to a variety of 
affinity tags such as hemagglutinin (HA), FLAG, Xpress, Myc, hexahistidine (His), 
glutathione S transferase (GST) and the like can be used to purify polypeptides. The His tag 
will also act as a chelating agent for certain metals (e.g., Ni) and thus the metals can also be 
30 used to purify His-containing polypeptides. After purification, the tag is optionally removed 
•jy'specific proteolytic cleavage. 

[1 00] It will be apparent to one of skill that chromatographic techniques can 
be performed at any scale and using equipment from many different manufacturers (e.g. t 
Pharmacia Biotech). 
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IV. DETECTION OF POLYNUCLEOTIDES OF THE INVENTION 

11011 Those of skill in the art will recognize that detection of expression of 
polynucleotides and polypeptides of the invention has many uses. For example, as discussed 
herein, detection of levels of polynucleotides and polypeptides of the invention in a patient is 
5 useful for diagnosing diabetes or a predisposition for at least some of the pathological effects 
of diabetes. Moreover, detection of gene expression is useful to identify modulators of 
expression of polynucleotides and polypeptides of the invention. 

[1 02] A variety of methods of specific DNA and RNA measurement that use 
nucleic acid hybridization techniques are known to those of skill in the art (see, Sambrook, 
10 supra). Some methods involve an electrophoretic separation (e.g., Southern blot for detecting 
DNA, and Northern blot for detecting RNA), but measurement of DNA and RNA can also be 
carried out in the absence of electrophoretic separation (e.g., by dot blot). Southern blot of 
genomic DNA (e.g., from a human) can be used for screening for restriction fragment length 
polymorphism (RFLP) to detect the presence of a genetic disorder affecting a polypeptide of 
15 the invention. 

(103] The selection of a nucleic acid hybridization format is not critical. A 

variety of nucleic acid hybridization formats are known to those skilled in the art. For 

example, common formats include sandwich assays and competition or displacement assays. 

Hybridization techniques are generally described in Hames and Higgins Nucleic Acid 
20 Hybridization, A Practical Approach, IRL Press (1985); Gall and Pardue, Proc. Natl. Acad. 

Sci. U.S.A., 63:378-383 (1969); and John et al. Nature, 223:582-587 (1969). 

[104] Detection of a hybridization complex may require the binding of a 

signal-generating complex to a duplex of target and probe polynucleotides or nucleic acids. 

Typically, such binding occurs through ligand and anti-ligand interactions as between a 
25 ligand-conjugated probe and an anti-ligand conjugated with a signal. The binding of the 

signal generation complex is also readily amenable to accelerations by exposure to ultrasonic 

energy. 

11 0SJ The label may also allow indirect detection of the hybridization 
complex. For example, where the label is a hapten or antigen, the sample can be detected by 
30 using antibodies. In these systems, a signal is generated by attaching fluorescent or enzyme 
molecules to the antibodies or in some cases, by attachment to a radioactive label (see, e.g., 
Tijssen, "Practice and Theory of Enzyme Immunoassays," Laboratory Techniques in 
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Biochemistry and Molecular Biology, Burdon and van Knippenberg Eds., Elsevier (1985), pp. 
9-20). 

[106] The probes are typically labeled either directly, as with isotopes, 
chromophores, lumiphores, chromogens, or indirectly, such as with biotin, to which a 
5 streptavidin complex may later bind. Thus, the detectable labels used in the assays of the 
present invention can be primary labels (where the label comprises an element that is detected 
directly or that produces a directly detectable element) or secondary labels (where the 
detected label binds to a primary label, e.g., as is common in immunological labeling). 
Typically, labeled signal nucleic acids are used to detect hybridization. Complementary 

1 0 nucleic acids or signal nucleic acids may be labeled by any one of several methods typically 
used to detect the presence of hybridized polynucleotides. The most common method of 
detection is the use of autoradiography with 3 H, l25 1, 35 S, ,4 C, or 32 P-labeled probes or the like. 

[107] Other labels include, e.g., ligands that bind to labeled antibodies, 
fluorophpres, chemiluminescent agents, enzymes, and antibodies that can serve as specific 

15 binding pair members for a labeled ligand. An introduction to labels, labeling procedures and 
detection of labels is found in Polak and Van Noorden Introduction to Immunocytochemistry, 
2nd ed., Springer Verlag, NY (1997); and in Haugland Handbook of Fluorescent Probes and 
Research Chemicals, a combined handbook and catalogue Published by Molecular Probes, 
Inc. (1996). 

20 [108] In general, a detector that monitors a particular probe or probe 

combination is used to detect the detection reagent label. Typical detectors include 
spectrophotometers, phototubes and photodiodes, microscopes, scintillation counters, 
cameras, film and the like, as well as combinations thereof. Examples of suitable detectors, 
are widely available from a variety of commercial sources known to persons of skill in the art. 

25 Commonly, an optical image of a substrate comprising bound labeling moieties is digitized 
for subsequent computer analysis. 

[109] The amount of, for example, an RNA is measured by quantifying the 
amount of label fixed to the solid support by binding of the detection reagent. Typically, the 
presence of a modulator during incubation will increase or decrease the amount of label fixed 

30 to the solid support relative to a control incubation that does not comprise the modulator, or 
as compared to a baseline established for a particular reaction type. Means of detecting and 
quantifying labels are well known to those of skill in the art. 

[110] In some embodiments, the target nucleic acid or the probe is 
immobilized on a solid support. Solid supports suitable for use in the assays of the invention 
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are known to those of skill in the art. As used herein, a solid support is a matrix of material 
in a substantially fixed arrangement. 

1111 J A variety of automated solid-phase assay techniques are also 
appropriate. For instance, very large scale immobilized polymer arrays (VLSIPS™), i.e. 
5 Gene Chips or microarrays, available from Affymetrix, Inc. in Santa Clara, CA can be used 
to detect changes in expression levels of a plurality of genes involved in the same regulatory 
pathways simultaneously. See, Tijssen, supra., Fodor et al (1991) Science, 251: 767- 777; 
Sheldon et al (1993) Clinical Chemistry 39(4): 718-71 9, and Kozal et al (1996) Nature 
Medicine 2(7): 753-759. Similarly, spotted cDNA arrays (arrays of cDNA sequences bound 
10 to nylon, glass or another solid support) can also be used to monitor expression of a plurality 
of genes. 

[112] Typically, the array elements are organized in an ordered fashion so 
that each element is present at a specified location on the substrate. Because the array 
elements are at specified locations on the substrate, the hybridization patterns and intensities 

15 (which together create a unique expression profile) can be interpreted in terms of expression 
levels of particular genes and can be correlated with a particular disease or condition or 
treatment. See, e.g., Schena et al, Science 270: 467-470 (1995)) and (Lockhart et al, Nature 
Biotech. 14: 1675-1680(1996)). 

[1 13] Hybridization specificity can be evaluated by comparing the 

20 hybridization of specificity-control polynucleotide sequences to specificity-control 

polynucleotide probes that are added to a sample in a known amount. The specificity-control 
target polynucleotides may have one or more sequence mismatches compared with the 
corresponding polynucleotide sequences. In this manner, whether only complementary target 
polynucleotides are hybridizing to the polynucleotide sequences or whether mismatched 

25 hybrid duplexes are forming is determined. 

(114) Hybridization reactions can be performed in absolute or differential 
hybridization formats. In the absolute hybridization format, polynucleotide probes from one 
sample are hybridized to the sequences in a microarray format and signals detected after 
hybridization complex formation correlate to polynucleotide probe levels in a sample. In the 

30 differential hybridization format, the differential expression of a set of genes in two 

biological samples is analyzed. For differential hybridization, polynucleotide probes from 
both biological samples are prepared and labeled with different labeling moieties. A mixture 
of the two labeled polynucleotide probes is added to a microarray. The microarray is then 
examined under conditions in which the emissions from the two different labels are 
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individually detectable. Sequences in the microarray that are hybridized to substantially equal 
numbers of polynucleotide probes derived from both biological samples give a distinct 
combined fluorescence (Shalon et al. PCT publication WO95/35505). In some embodiments, 
the labels are fluorescent labels with distinguishable emission spectra, such as Cy3 and Cy5 
5 fluorophores. 

(1 1 5] After hybridization, the microarray is washed to remove nonhybridized 
nucleic acids and complex formation between the hybridizable array elements and the 
polynucleotide probes is detected. Methods for detecting complex formation are well known 
to those skilled in the art. In some embodiments, the polynucleotide probes are labeled with a 

1 0 fluorescent label and measurement of levels and patterns of fluorescence indicative of 
complex formation is accomplished by fluorescence microscopy, such as confocal 
fluorescence microscopy. 

[116] In a differential hybridization experiment, polynucleotide probes from 
two or more different biological samples are labeled with two or more different fluorescent 

1 5 labels with different emission wavelengths. Fluorescent signals are detected separately with 
different photomultipliers set to detect specific wavelengths. The relative 
abundances/expression levels of the polynucleotide probes in two or more samples are 
obtained. 

(1 1 7] Typically, microarray fluorescence intensities can be normalized to 
20 take into account variations in hybridization intensities when more than one microarray is 
used under similar test conditions. In some embodiments, individual polynucleotide 
probe/target complex hybridization intensities are normalized using the intensities derived 
from internal normalization controls contained on each microarray: 

[118] Detection of nucleic acids can also be accomplished, for example, by 
25 using a labeled detection moiety that binds specifically to duplex nucleic acids (e.g., an 
antibody that is specific for RNA-DNA duplexes). One example uses an antibody that 
recognizes DN A-RNA heteroduplexes in which the antibody is linked to an enzyme 
(typically by recombinant or covalent chemical bonding). The antibody is detected when the 
enzyme reacts with its substrate, producing a detectable product. Coutlee et al. (1989) 
30 Analytical Biochemistry 181:153-162; Bogulavski (1986) et al. J. Immunol. Methods 89:123- 
130; Prooijen-Knegt (1982) Exp. Cell Res. 141 :397-407; Rudkin (1976) Nature 265:472-473, 
Stollar (1970) PNAS 65:993-1000; Ballard (1982) Mol Immunol. 19:793-799; Pisetsky and 
Caster (1982) Moi Immunol 19:645-650; Viscidi etal (1988)J. Clin. Microbial. 41:199- 
209; and Kiney et ai (1989) / Clin. Microbiol. 27:6-12 describe antibodies to RNA duplexes, 
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including homo and heteroduplexes. Kits comprising antibodies specific forDNA:RNA 
hybrids are available, e.g. y from Digene Diagnostics, Inc. (Beltsville, MD). 

[119] In addition to available antibodies, one of skill in the art can easily 
make antibodies specific for nucleic acid duplexes using existing techniques, or modify those 
antibodies that are commercially or publicly available. In addition to the art referenced above, 
general methods for producing polyclonal and monoclonal antibodies are known to those of 
skill in the art (see, e.g, Paul (ed) Fundamental Immunology, Third Edition Raven Press, Ltd., 
NY (1993); Coligan Current Protocols in Immunology Wiley/Greene, NY (1991); Harlow 
and Lane Antibodies: A Laboratory Manual Cold Spring Harbor Press, NY (1989); Stites et 
al (eds.) Basic and Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, 
CA, and references cited therein; Goding Monoclonal Antibodies: Principles and Practice 
(2d ed.), Academic Press, New York, NY, (1986); and Kohler and Milstein Nature 256: 495- 
497 (1975)). Other suitable techniques for antibody preparation include selection of libraries 
of recombinant antibodies in phage or similar vectors (see, Huse et al Science 246:1275- 
1281 (1989); and Ward et al Nature 341 :544-546 (1989)). Specific monoclonal and 
polyclonal antibodies and antisera will usually bind with a K D of at least about 0.1 jaM, 
preferably at least about 0.01 jiM or better, and most typically and preferably, 0.001 jjM or 
better. 

[120] The nucleic acids used in this invention can be either positive or 
negative probes. Positive probes bind to their targets and the presence of duplex formation is 
evidence of the presence of the target. Negative probes fail to bind to the suspect target and 
the absence of duplex formation is evidence of the presence of the target. For example, the 
use of a wild type specific nucleic acid probe or PCR primers may serve as a negative probe 
in an assay sample where only the nucleotide sequence of interest is present. 

[121] The sensitivity of the hybridization assays may be enhanced through 
use of a nucleic acid amplification system that multiplies the target nucleic acid being 
detected. Examples of such systems include the polymerase chain reaction (PCR) system and 
the ligase chain reaction (LCR) system. Other methods recently described in the art are the 
nucleic acid sequence based amplification (NASBA, Cangene, Mississauga, Ontario) and Q 
Beta Replicase systems. These systems can be used to directly identify mutants where the 
PCR or LCR primers are designed to be extended iiyated only when a selected sequence is 
present Alternatively, the selected sequences can be generally amplified using, for example, 
nonspecific PCR primers and the amplified target region later probed for a specific sequence 
indicative of a mutation. It is understood that various detection probes, including Taqman • 
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and molecular beacon probes can be used to monitor amplification reaction products, e.g., in 
real time. 

[122] An alternative means for determining the level of expression of the 
nucleic acids of the present invention is in situ hybridization. In situ hybridization assays are 
5 well known and are generally described in Angerer et aL, Methods Enzymol. 152:649-660 
(1987). In an in situ hybridization assay, cells, preferentially human cells from the 
cerebellum or the hippocampus, are fixed to a solid support, typically a glass slide. If DNA is 
to be probed, the cells are denatured with heat or alkali. The cells are then contacted with a 
hybridization solution at a moderate temperature to permit annealing of specific probes that 

1 0 are labeled. The probes are preferably labeled with radioisotopes or fluorescent reporters. 

[123J Single nucleotide polymorphism (SNP) analysis is also useful for 
detecting differences between alleles of the polynucleotides (e.g., genes) of the invention. 
SNPs linked to genes encoding polypeptides of the invention are useful, for instance, for 
diagnosis of diseases (e.g., diabetes) whose occurrence is linked to the gene sequences of the 

1 5 invention. For example, if an individual carries at least one SNP linked to a disease- 
associated allele of the gene sequences of the invention, the individual is likely predisposed 
for one or more of those diseases. If the individual is homozygous for a disease-linked SNP, 
the individual is particularly predisposed for occurrence of that disease (e.g., diabetes). In 
some embodiments, the SNP associated with the gene sequences of the invention is located 

20 within 300,000; 200,000; 100,000; 75,000; 50,000; or 10,000 base pairs from the gene 
sequence. 

[1241 Various real-time PCR methods including, e.g., Taqman or molecular 
beacon-based assays (e.g., U.S. Patent Nos. 5,210,015; 5,487,972; Tyagi et aL, Nature 
Biotechnology 14:303 (1996); and PCT WO 95/1 3399 are useful to monitor for the presence 

25 of absence of a SNP. Additional SNP detection methods include, e.g., DNA sequencing, 
sequencing by hybridization, dot blotting, oligonucleotide array (DNA Chip) hybridization 
analysis, or are described in, e.g., U.S. Patent No. 6,177,249; Landegren et aL, Genome 
Research, 8:769-776 (1998); Botstein et aL, Am J Human Genetics 32:314-331 (1980); 
Meyers et aL, Methods in Enzymology 155:501-527 (1987); Keen et aL, Trends in Genetics 

30 7:5 (1991); Myers et aL, Science 230:1242-1246 (1985); and Kwok et aL, Genomics 23:138- 
144(1994). 
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V. DETECTION OF POLYPEPTIDES OF THE INVENTION 

f 1 25] In addition to the detection of polynucleotides of the invention and 
gene expression using nucleic acid hybridization technology, one can also use immunoassays 
to detect polypeptides of the invention. Immunoassays can be used to qualitatively or 
quantitatively analyze polypeptides of the invention. A general overview of the applicable 
technology can be found in Harlow & Lane, Antibodies: A Laboratory Manual (1988). 

A. Antibodies to Target Proteins or other immunogens 

[126] Methods for producing polyclonal and monoclonal antibodies that 
react specifically with a protein of interest or other immunogen are known to those of skill in 
the art (see, e.g., Coligan, supra; and Harlow and Lane, supra; Stites et al. 9 supra and 
references cited therein; Goding, supra; and Kohler and Milstein Nature 9 256:495-497 
(1975)). Such techniques include antibody preparation by selection of antibodies from 
libraries of recombinant antibodies in phage or similar vectors (see, Huse et a/., supra; and 
Ward et a/., supra). For example, in order to produce antisera for use in an immunoassay, the 
protein of interest or an antigenic fragment thereof, is isolated as described herein. For 
example, a recombinant protein is produced in a transformed cell line. An inbred strain of 
mice or rabbits is immunized with the protein using a standard adjuvant, such as Freund's 
adjuvant, and a standard immunization protocol. Alternatively, a synthetic peptide derived 
from the; sequences disclosed herein is conjugated to a carrier protein and used as an 
immunogen. 

|127] Polyclonal sera are collected and titered against the immunogen in an 
immunoassay, for example, a solid phase immunoassay with the immunogen immobilized on 
a solid support. Polyclonal antisera with a titer of 10 4 or greater are selected and tested for 
their crossreactivity against proteins other than the polypeptides of the invention or even 
other homologous proteins from other organisms, using a competitive binding immunoassay. 
Specific monoclonal and polyclonal antibodies and antisera will usually bind with a K D of at 
least about 0.1 mM, more usually at least about 1 fiM, preferably at least about 0.1 \M or 
better, and most preferably, 0.01 jiM or better. 

[128] A number of proteins of the invention comprising immunogens may be 
used to produce antibodies specifically or selectively reactive with the proteins of interest. 
Recombinant protein is the preferred immunogen for the production of monoclonal or 
polyclonal antibodies. Naturally occurring protein may also be used either in pure or impure 
form. Synthetic peptides made using the protein sequences described herein may also be 
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used as an immunogen for the production of antibodies to the protein. Recombinant protein 
can be expressed in eukaryotic or prokaryotic cells and purified as generally described supra. 
The product is then injected into an animal capable of producing antibodies. Either 
monoclonal or polyclonal antibodies may be generated for subsequent use in immunoassays 
5 to measure the protein. 

[129 J Methods of production of polyclonal antibodies are known to those of 
skill in the art. In brief, an immunogen, preferably a purified protein, is mixed with an 
adjuvant and animals are immunized. The animal's immune response to the immunogen 
preparation is monitored by taking test bleeds and determining the titer of reactivity to 

1 0 polypeptides of the invention. When appropriately high titers of antibody to the immunogen 
are obtained, blood is collected from the animal and antisera are prepared. Further 
fractionation of the antisera to enrich for antibodies reactive to the protein can be done if 
desired (see, Harlow and Lane, supra). 

[130J Monoclonal antibodies may be obtained using various techniques 

15 familiar to those of skill in the art. Typically, spleen cells from an animal immunized with a 
desired antigen are immortalized, commonly by fusion with a myeloma cell {see, Kohler and 
Milstein, Eur. 1 Immunol 6:51 1-519 (1976)). Alternative methods of immortalization 
include, e.g., transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are screened 

20 for production of antibodies of the desired specificity and affinity for the antigen, and yield of 
the monoclpnal antibodies produced by such cells may be enhanced by various techniques, 
including injection into the peritoneal cavity of a vertebrate host. Alternatively, one may 
isolate DNA sequences that encode a monoclonal antibody or a binding fragment thereof by 
screening a DNA library from human B cells according to the general protocol outlined by 

25 Huse et al, supra. 

(131 J Once target immunogen-specific antibodies are available, the 
immunogen can be measured by a variety of immunoassay methods with qualitative and 
quantitative results available to the clinician. For a review of immunological and 
immunoassay procedures in general see, Stites, supra. Moreover, the immunoassays 1 of the 

30 present invention can be performed in any of several configurations, which are reviewed 
extensively in Maggio Enzyme Immunoassay, CRC Press, Boca Raton, Florida (19X0);** 
Tijssen, supra; and Harlow and Lane, supra. 
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[132J Immunoassays to measure target proteins in a human sample may use 
a polyclonal antiserum that was raised to full-length polypeptides of the invention or a 
fragment thereof. This antiserum is selected to have low cross-reactivity against other 
proteins and any such cross-reactivity is removed by immunoabsorption prior to use in the 
immunoassay. 

B. Immunological Binding Assays 

[133] In some embodiments, a protein of interest is detected and/or 
quantified using any of a number of well-known immunological binding assays (see, e.g., U.S. 
Patents 4,366,241; 4,376,1 10; 4,517,288; and 4,837,168). For a review of the general 
immunoassays, see also Asai Methods in Cell Biology Volume 37: Antibodies in Cell Biology, 
Academic Press, Inc. NY (1993); Stites, supra. Immunological binding assays (or 
immunoassays) typically utilize a "capture agent" to specifically bind to and often 
immobilize the analyte (e.g., full-length polypeptides of the present invention, or antigenic 
subsequences thereof). The capture agent is a moiety that specifically binds to the analyte. 
The antibody may be produced by any of a number of means well known to those of skill in 
the art and as described above. 

1134] Immunoassays also often utilize a labeling agent to bind specifically to 
and label the binding complex formed by the capture agent and the analyte. The labeling 
agent may itself be one of the moieties comprising the antibody/analyte complex. 
Alternatively, the labeling agent may be a third moiety, such as another antibody, that 
specifically binds to the antibody/protein complex. 

(135] In a preferred embodiment, the labeling agent is a second antibody 
bearing a label. Alternatively, the second antibody may lack a label, but it may, in turn, be 
bound by a labeled third antibody specific to antibodies of the species from which the second 
antibody is derived. The second antibody can be modified with a detectable moiety, such as 
biotin, to which a third labeled molecule can specifically bind, such as enzyme-labeled 
streptavidin. 

[136] Other proteins capable of specifically binding immunoglobulin 
constant regions, such as protein A or protein G, can also be used as the label agents. These 
proteins are normal constituents of the cell walls of streptococcal bacteria. They exhibit a 
strong non-immunogenic reactivity with immunoglobulin constant regions from a variety of 
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species (see, generally, Kronval, et al J. Immunol., Ill: 1401-1406 (1973); and Akerstrom, et 
all Immunol, 135:2589-2542 (1985)). 

[1371 Throughout the assays, incubation and/or washing steps may be 
required after each combination of reagents. Incubation steps can vary from about 5 seconds 
5 to several hours, preferably from about 5 minutes to about 24 hours. The incubation time will 
depend upon the assay format, analyte, volume of solution, concentrations, and the like. 
Usually, the assays will be carried out at ambient temperature, although they can be 
conducted over a range of temperatures, such as 10°G to 40°C. 

1 . Non-Competitive Assay Formats 

1 0 1138] Immunoassays for detecting proteins or analytes of interest from tissue 

samples may be either competitive or noncompetitive. Noncompetitive immunoassays are 
assays in which the amount of captured protein or analyte is directly measured. In one 
preferred "sandwich" assay, for example, the capture agent (e.g., antibodies specific for the 
polypeptides of the invention) can be bound directly to a solid substrate where it is 

15 immobilized. These immobilized antibodies then capture the polypeptide present in the test 
sample. The polypeptide of the invention thus immobilized is then bound by a labeling agent, 
such as a second labeled antibody specific for the polypeptide. Alternatively, the second 
antibody may lack a label, but it may, in turn, be bound by a labeled third antibody specific to 
antibodies.of the species from which the second antibody is derived. The second can be 

20 modified with a detectable moiety, such as biotin, to which a third labeled molecule can 
specifically bind, such as enzyme-labeled streptavidin. 

2. Competitive Assay Formats 

[139] In competitive assays, the amount of protein or analyte present in the 
sample is measured indirectly by measuring the amount of an added (exogenous) protein or 

25 analyte displaced (or competed away) from a specific capture agent (e.g., antibodies specific 
for a polypeptide of the invention) by the protein or analyte present in the sample. The 
amount of immunogen bound to the antibody is inversely proportional to the concentration of 
immunogen present in the sample. In a particularly preferred embodiment, the antibody is 
immobilized on a solid substrate. The amount of analyte may be detected by providing a 

30 labeled analyte molecule. It is understood that labels can include, e.g., radioactive labels as 
well as peptide or othe.r tags that can be recognized by detection reagents such as antibodies. 

[140] Immunoassays in the competitive binding format can be used for cross* 
reactivity determinations. For example, the protein encoded by the sequences described 
herein can be immobilized on a solid support. Proteins are added to the assay and compete 
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with the binding of the antisera to the immobilized antigen. The ability of the above proteins 
to compete with the binding of the antisera to the immobilized protein is compared to that of 
the protein encoded by any of the sequences described herein, the percent cross-reactivity 
for the above proteins is calculated, using standard calculations. Those antisera with less than 
10% cross-reactivity with each of the proteins listed above are selected and pooled. The 
cross-reacting antibodies are optionally removed from the pooled antisera by 
immunoabsorption with the considered proteins, e.g. f distantly related homologs. 

The immunoabsorbed andpooled antisera are then used in a 
competitive binding immunoassay as described above to compare a second protein, thought 
to be perhaps a protein of the present invention, to the immunogen protein. In order to make 
this comparison, the two proteins are each assayed at a wide range of concentrations and the 
amount of each protein required to inhibit 50% of the binding of the antisera to the 
immobilized protein is determined. If the amount of the second protein required is less than 
10 times the amount of the protein partially encoded by a sequence herein that is required, 
then the second protein is said to specifically bind to an antibody generated to an immunogen 
consisting of the target protein. 

3. Other Assay Formats 

[142) In some embodiments, western blot (immunoblot) analysis is used to 
detect and quantify the presence of a polypeptide of the invention in the sample. The 
technique generally comprises separating sample proteins by gel electrophoresis on the basis 
of molecular weight, transferring the separated proteins to a suitable solid support (such as, 
e.g., a nitrocellulose filter, a nylon filter, or a derivatized nylon filter) and incubating the 
sample with the antibodies that specifically bind the protein of interest. For example, 
antibodies are selected that specifically bind to the polypeptides of the invention on the solid 
support. These antibodies may be directly labeled or alternatively may be subsequently 
detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) that specifically 
bind to the antibodies against the protein of interest. 

[143] Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 
techniques (see, Monroe et al ( 1 986) Amer. Clin. Prod. Rev. 5:34-41). 

4t M>els 

[144J The particular label or detectable group used in the assay is not a 
critical aspect of the invention, as long as it does not significantly interfere with the specific 
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binding of the antibody used in the assay. The detectable group can be any material having a 
detectable physical or chemical property. Such detectable labels have been well-developed in 
the field of immunoassays and, in general, most labels useful in such methods can be applied 
to the present invention. Thus, a label is any composition detectable by spectroscopic, 
5 photochemical, biochemical, immunochemical, electrical, optical or chemical means. Useful 
labels in the present invention include magnetic beads (e.g., Dynabeads™), fluorescent dyes 
(e.g. t fluorescein isothiocyanate, Texas red, rhodamine, and the like), radiolabels (e.g, 3 H, 
l25 1, 35 S, ,4 C, or 32 P), enzymes (e.g.. horse radish peroxidase, alkaline phosphatase and others 
commonly used in an ELISA), and colorimetric labels such as colloidal gold or colored glass 

10 or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. 

(145) The label may be coupled directly or indirectly to the desired 
component of the assay according to methods well known in the art. As indicated above, a 
wide variety of labels may be used, with the choice of label depending on the sensitivity 
required, the ease of conjugation with the compound, stability requirements, available 

1 5 instrumentation, and disposal provisions. 

[146] Non-radioactive labels are often attached by indirect means. The 
molecules can also be conjugated directly to signal generating compounds, e.g., by 
conjugation with an enzyme or fluorescent compound. A variety of enzymes and fluorescent 
compounds can be used with the methods of the present invention and are well-known to 

20 those of skill in the art (for a review of various labeling or signal producing systems which 
may be used, see, e.g., U.S. Patent No. 4,391,904). 

[147] Means of detecting labels are well known to those of skill in the art. 
Thus, for example, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic film as in autoradiography. Where the label is a 

25 fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
wavelength of light and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic film, by the use of electronic detectors such as 
charge coupled devices (CCDs) or photomultipliers and the like. Similarly, enzymatic labels 
may be detected by providing the appropriate substrates for the enzyme and detecting the 

30 resulting reaction product. Finally simple colorimetric labels may be detected directly by 
observing the color associated with the label Thus, in various dipstick assays, conjugated 
gold often appears pink, while various conjugated beads appear the color of the bead. 

[1481 Some assay formats do not require the use of labeled components. For 
instance, agglutination assays can be used to detect the presence of the target antibodies. In 
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this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need to be labeled and the presence of the 
target antibody is detected by simple visual inspection. 

VI. IDENTIFICATION OF MODULATORS OF POLYPEPTIDES OF THE 
INVENTION 

{1 49] Modulators of a polypeptide of the invention, i.e. agonists or 
antagonists of a polypeptide's activity, or polypeptide's or polynucleotide's expression or 
full-length polypeptides of the invention or fragments thereof, are useful for treating a 
number of human diseases, including diabetes or obesity. For example, administration of 
modulators can be used to treat diabetic patients or prediabetic individuals to prevent 
progression, and therefore symptoms, associated with diabetes (including insulin resistance). 
Modulators of the invention can also be used to reduce obesity as well as the various diseases 
associated with obesity (e.g., gallbladder disease, cancer, sleep apnea, atherosclerosis, 
diabetes, and hypertension). In some cases, the modulators of the invention are used to 
regulate body physiology to reduce the chance of obesity-related diseases. For example, the 
modulators can be used to regulate serum lipids (total cholesterol, low-density lipoprotein 
(LDL), cholesterol, LDL/high density lipoprotein ratio and triglycerides). 

A. Agents that Modulate Polypeptides of the Invention 

[1 50J The agents tested as modulators of polypeptides of the invention can 

be any small chemical compound, or a biological entity, such as a protein, sugar, nucleic acid 
or lipid. Essentially any chemical compound can be used as a potential modulator or ligand 
in the assays of the invention, although most often compounds that can be dissolved in 
aqueous or organic (especially DMSO-based) solutions are used. Modulators include agents 
designed to reduce the level of mRNA encoding a polypeptide of the invention (e.g. antisense 
molecules, ribozymes, DNAzymes, small inhibitory RNAs and the like) or the level of 
translation from an mRNA (e.g., translation blockers such as an antisense molecules that are 
complementary to translation start or other sequences on an mRNA molecule). Modulators 
of the invention also include antibodies that specific bind to and/or inhibit or activate the 
polypeptides of the invention. Other modulators include the polypeptides of the invention 
themselves, fragments thereof, or fusion proteins comprising the polypeptides or fragments 
thereof (e.g., in some embodiments, comprising at least 25, 50, or 100 amino acids of the 
polypeptide). For polypeptides of the invention that are receptors, soluble fragments of the 
polypeptides (i.e., lacking a transmembrane domain) can act as modulators of polypeptide 
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signaling activity. For polypeptides of the invention that are secreted, both full length and 
fragments with biological activity can act as modulators. It will be appreciated that there are 
many suppliers of chemical compounds, including Sigma (St Louis, MO), Aldrich (St. Louis, 
MO), Sigma- Aldrich (St. Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs, 
5 Switzerland) and the like. 

[151] In some embodiments, high throughput screening methods involve 
providing a combinatorial chemical or peptide library containing a large number of potential 
therapeutic compounds (potential modulator compounds). Such "combinatorial chemical 
libraries' 9 or "ligand libraries" are then screened in one or more assays, as described herein, to 

1 0 identify those library members (particular chemical species or subclasses) that display a 
desired characteristic activity. The compounds thus identified can serve as conventional 
"lead compounds'* or can themselves be used as potential or actual therapeutics. 

[152] A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis, by combining a 

1 5 number of chemical tc building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a polypeptide library is formed by combining a set of chemical 
building blocks (amino acids) in every possible way for a given compound length (/.&, the 
number of amino acids in a polypeptide compound). Millions of chemical compounds can be 
synthesized through such combinatorial mixing of chemical building blocks. 

20 [153] Preparation and screening of combinatorial chemical libraries is well 

known to those of skill in the art. Such combinatorial chemical libraries include, but are not 
limited to, peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka, Int. J. Pept. Prot. Res. 
37:487-493 (1991) and Houghton et aL, Nature 354:84-88 (1991)). Other chemistries for 
generating chemical diversity libraries can also be used. Such chemistries include, but are 

25 not limited to: peptoids (e.g., PCT Publication No. WO 91/1 9735), encoded peptides (e.g., 
PCT Publication WO 93/20242), random bio-oligomers (e.g., PCT Publication No. WO 
92/00091), benzodiazepines (eg., U.S. Pat. No. 5,288,514), diversomers such as hydantoins, 
benzodiazepines and dipeptides (Hobbs et al, Proc. Nat Acad. ScL USA 90:6909-6913 
(1993)), vinylogous polypeptides (Hagihara et ai, J. Amer. Chem. Soc. 1 14:6568 (1992)), 

30 nonpep tidal peptidomirnetics with glucose scaffolding (Hirschmann et aL, J. Amer. Chem. 
Soc. 1 14:9217-9218 (1992)), analogous organic syntheses of small compound libraries (Chen 
et al, J. Amer. Chem, Soc. 1 16:2661 (1994)), oligocarbamates (Cho et aL, Science 261:1303 
(1993)), and/or peptidyl phosphonates (Campbell et ai y J. Org. Chem. 59:658 (1994)), 
nucleic acid libraries (see Ausubel, Berger and Sambrook, all supra), peptide nucleic acid 
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libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, eg., Vaughn et al. t Nature 
Biotechnology, 14(3):309-314 (1996) and PCT/US96/10287), carbohydrate libraries (see. e.g, 
Liang et aL, Science, 274:1520-1522 (1996) and US. Patent 5,593,853), small organic 
molecule libraries (see t e.g., benzodiazepines, Baum C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent 5,569,588; thiazolidinones and metathiazanones, U.S. Patent 
5,549,974; pyrrolidines, U.S. Patents 5,525,735 and 5,519,134; morpholino compounds, U.S. 
Patent 5,506,337; benzodiazepines, 5,288,514, and the like). 

[154] Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Tripos, Inc., St. Louis, MO, 3D 
Pharmaceuticals, Exton, PA, Martek Biosciences, Columbia, MD, etc.). 

B. Methods of Screening for Modulators of the Polypeptides of the Invention " 
[155] A number of different screening protocols can be utilized to identify 
agents that modulate the level of expression or activity of a polynucleotide of a polypeptide 
of the invention in cells, particularly mammalian cells, and especially human cells. In 
general terms, the screening methods involve screening a plurality of agents to identify an 
agent that modulates the activity of a polypeptide of the invention by, e.g., binding to the 
polypeptide, preventing an inhibitor or activator from binding to the polypeptide, increasing 
association of an inhibitor or activator with the polypeptide, or activating or inhibiting 
expression of the polypeptide. The assays can be designed to screen large chemical libraries 
by automating the assay steps and providing compounds from any convenient source to 
assays, which are typically run in parallel (e.g. 9 in microtiter formats on microtiter plates in 
robotic assays). 

[156J Any cell expressing a full-length polypeptide of the invention or a 
fragment thereof can be used to identify modulators. In some embodiments, the cells are 
eukaryotic cells lines (e.g., CHO or HEK293) transformed to express a heterologous 
polypeptide of the invention. In some embodiments, a cell expressing an endogenous 
polypeptide of the invention is used in screens. In other embodiments, modulators are 
screened for their ability to affect insulin responses. In other embodiments, modulators are 
screened for their ability to effect body weight (as measured by BMI or waist-to-hip ratio) 
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and secretion of a variety of obesity markers (e.g., leptin, EL-6 or TNF alpha). In other 
embodiments, modulators are screened for their ability to effect lipid metabolism. In other 
embodiments, modulators are screened for their ability to effect the secretion and activity of 
adipogenic factors. 

5 (157J In some embodiments, modulators of ADLICAN comprising the 

amino acidsequence of SEQ ID NO: 2, 4, or 6, may be identified using, e.g., modulator 
binding assays, expression assays or promoter-reporter assays. 

(158] In some embodiments, modulators of ALDH1 A3 comprising the 
amino acid sequence of SEQ ID NO: 8, 10, or 12, may be identified using, e.g., modulator 
1 0 binding assays, expression assays, promoter-reporter assays, or assays based on the retinoic 
acid production. 

(159] In some embodiments, modulators of ALK7 comprising the amino acid 
sequence of SEQ ID NO: 14, 16, or 18, may be identified using, e.g., expression assays, 
promoter-reporter assays, modulator binding assays or kinase assays based on Smad2 

1 S phosphorylation. Kinase assays can be carried out after contacting either purified 

recombinant ALK7 protein, or an intact cell with the modulator. Modulators which bind to 
the ALK7 can be screened by a ligand binding assay method using e.g. nodal as the ligand. 

[160J In some embodiments, modulators of C3AR1 comprising the amino 
acid sequence of SEQ ID NO: 20, 22, or 24, may be identified using, e.g., the expression 

20 assays, promoter-reporter assays, binding assays, or screening methods that monitor 

modulator-induced fluctuation of intracellular Ca** concentration. Modulators which bind to 
the C3AR1 can be screened by a ligand binding assay method using e.g. complement 
anaphylatoxin C3a as the ligand. 

[161] In some embodiments, modulators of CALCRL comprising the amino 

25 acid sequence of SEQ ID NO: 26, 28, or 30, may be identified using, e.g., expression assays, 
promoter-reporter assays, modulator binding assays, or screening methods that monitor 
modulator-induced fluctuation of intracellular cAMP concentration. Modulators which bind 
to the CALCRL can be screened by a ligand binding assay method using, e.g., 
adrenomedullin or calcitonin gene related peptide as ligands. 

30 [1 62] In some embodiments, modulators of CCL1 3 comprising the amino 

acid sequence of SEQ ID NO: 32, 33, 35, or 37, pray be identified using, e.g., expression 
assays promoter-reporter assays, modulator binding assays, or screening methods that 
monitor modulator-induced fluctuation of intracellular Ca 4 * concentration. Modulators 
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which bind to the CCL13 can be screened by a ligand binding assay method using, e.g., 
CCR1 or other C-C G-protein coupled receptors known to bind to CCL13. 

[163] In some embodiments, modulators of CCL8 comprising the amino acid 
sequence of SEQ ID NO: 39, 40, 42, or 44, may be identified using, e.g., expression assays, 
promoter-reporter assays, modulator binding assays, or screening methods that monitor 
modulator-induced fluctuation of intracellular Ca** concentration. Modulators which bind to 
the CCL8 can be screened by a ligand binding assay method using, e.g., CCR1 or other C-C 
G-protein coupled receptors known to bind to CCL8. 

1164] In some embodiments, modulators of CHI3L1 comprising the amino 
acid sequence of SEQ ID NO: 46, 47, 49, or 51, maybe identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays, or screening methods that 
monitor modulator-induced fluctuation of phosphorylation and activity of MAPK and/or 
AKT (see, e.g., Recklies, A.D. et aL, Biochem J. 365:1 19-26 (2002)). 

[165] In some embodiments, modulators of CR1 comprising the amino acid 
sequence of SEQ ID NO: 53 or 55, may be identified using, e.g., expression assays, 
promoter-reporter assays, or modulator binding assays. Modulators which bind to the CR1 
can be screened by a ligand binding assay method using, e.g., complement component C3b as 
ligand. 

[166] In some embodiments, modulators of CSFR1 comprising the amino 
acid sequence of SEQ ID NO: 57, 59 or 61, may be identified using, e.g., expression assays, 
promoter-reporter assays, modulator binding assays or activity assays. Kinase assays can be 
carried out after contacting either purified recombinant CSFR1 protein or an intact cell with 
modulators. Modulators which bind to the CSFR1 can be screened by a ligand binding assay 
method using e.g clony stimulating factor as ligand. 

[1 67] In some embodiments, modulators of CTSK comprising the amino acid 
sequence of SEQ ID NO: 63, 64, 66 or 68, may be identified using, e.g., expression assays, 
promoter-reporter assays, modulator binding assays, or enzyme assays. Enzyme assays can 
be carried out after contacting either purified recombinant CTSK protein, or an intact cell 
with a modulator using e.g. fibrinogen as a substrate. 

[168] In some embodiments, modulators of CXCR4 comprising the amino 
acid sequence of SEQ ID NO: 70, 72 or 74, may be identified using, e.g., expression assays, 
promoter-reporter assays, modulator binding assays, or screening methods that monitor 
modulator-induced fluctuation of intracellular Ca~ concentration or phosphorylation and 
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activity of MAPK and/or AKT. Modulators which bind to the CXCR4 can be screened by a 
ligand binding assay method using e.g. CXCL12 as a ligand. 

[1 69] In some embodiments, modulators of DDAH2 comprising the amino 
acid sequence of SEQ ID NO: 76, 78 or 80, may be identified using, e.g., expression assays, 
5 promoter-reporter assays, modulator binding assays, or activity assays. Modulators which 
effect DDAH2 activity can be screened by measuring the conversation of ADMA to citrulline 
and methylamines. 

[1 70] In some embodiments, modulators of DERP7 comprising the amino 
acid sequence of SEQ ID NO: 82, 84 or 86, may be identified using, e.g., expression assays, 

10 modulator binding assays, or promoter-reporter assays. 

[171] In some embodiments, modulators of ENDOGLYX1 comprising the 
amino acid sequence of SEQ ID NO: 88, 90 or 92, may be identified using, e.g., expression 
assays, modulator binding assays, promoter-reporter or activity assays based on angiogenesis 
(see, e.g., Christian, S. et aL, J. Biol Chem. 276: 48588-48595 (2001)). 

15 Jl 72] In some embodiments, modulators of ETL comprising the amino acid 

sequence of SEQ ID NO: 94, 96 or 98, may be identified using, e.g., expression assays, 
promoter-reporter assays, modulator binding assays, or assays based on a G-protein coupled 
receptor activity. 

[173] In some embodiments, modulators of FIJI 2389 comprising the amino 
20 acid sequence of SEQ ID NO: 100, 102, 104, 106 or 108, may be identified using, e.g., 

expression assays, promoter-reporter assays, modulator binding assays, or assays based on an 
AMP binding or an acetoacetate-CoA ligase activity. 

[1 74] In some embodiments, modulators of FZD4 comprising the amino acid 
sequence of SEQ ID NO: 1 10, 1 12, 1 14 or 1 16, may be identified using, e.g., expression 
25 assays, promoter-reporter assays or modulator binding assays. Modulators which bind to the 
FZD4 can be screened by a ligand binding assay method using e.g. norrin as a ligand. 

(175] In some embodiments, modulators of GLIPR1 comprising the amino 
acid sequence of SEQ ID NO: 1 18, 120 or 122, may be identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays, or activity assays based on an 
30 induction of apoptosis. 

[176] In some embodiments, modulators of GPR1 05 comprising the amino 
acid sequence of SEQ ID NO: 124, 126 or 128, may be identified using, e.g., expression 
assays, promoter-reporetr assays, modulator binding assays, or screening methods that 
monitor modulator-induced fluctuation of intracellular Cz* concentration. Modulators 

48 

BNSDOCIO: <WO_ - 200800740GA2J_> 



WO 2006/007400 



PCT/US20O5/O21297 



which bind to the GPR105 can be screened by a ligand binding assay method using, e.g., 
UDP-glucose as ligand. 

[177] In some embodiments, modulators of GPR146 comprising the amino 
acid sequence of SEQ ID NO: 1 30, 1 32 or 1 34, may be identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays, or assays based on a G-protein 
coupled receptor activity. 

[1 78] In some embodiments, modulators of GPR30 comprising the amino 
acid sequence of SEQ ID NO: 136, 138 or 140, may be identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays, or screening methods that 
monitor modulator-induced fluctuation of intracellular cAMP concentration or 
phosphorylation and activity of MAPK. 

[179] In some embodiments, modulators of GPR65 comprising the amino 
acid sequence of SEQ ID NO: 142, 144 or 146, may be identified using, e.g., expression 
assays, promoter-reporter assays or modulator binding assays. Modulators which bind to the 
GPR65 can be screened by a ligand binding assay method using, e.g., psychosine as ligand. 

(180J In some embodiments, modulators of HTR2B comprising the amino 
acid sequence of SEQ ID NO: 148, 150 or 152, maybe identified using, e.g., expression 
assays, promoter-reporter assays or modulator binding assays. Modulators which bind to the 
HTR2B can be screened by a ligand binding assay method using, e.g., serotonin as ligand. 
Assays detecting phosphoinositide phospholipase C activity can be used. 

1181] In some embodiments, modulators of ITGB2 comprising the amino 
acid sequence of SEQ ID NO: 154, 156 or 158, may be identified using, e.g., expression 
assays, promoter-reporter assays or modulator binding assays. Modulators which bind to the 
1TGB2 can be screened by a ligand binding assay method using, e.g., ITG alpha chain protein. 

1 182] In some embodiments, modulators of ITIH5 comprising the amino acid 
sequence of SEQ ID NO: 160, 161 or 163, may be identified using, e.g., expression assays, 
promoter-reporter assays, or modulator binding assays. 

[1 83] In some embodiments, modulators of LGALS 1 2 comprising the amino 
acid sequence of SEQ ID NO: 165, 167, 169, 171, 173, 175, 177 or 179, may be identified 
using, e.g., expression assays, promoter-reporter assays, modulator binding assays, or 
screening methods that monitor modulator-induced fluctuation in apoptosis. 

[184] In some embodiments, modulators of NMB comprising the amino acid 
sequence of SEQ ID NO: 181, 182, 184 or 186, may be identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays, screening methods that monitor 
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modulator-induced fluctuation of intracellular Ca ++ concentration or binding assays. 
Modulators which bind to the NMB can be screened by a ligand binding assay method using 
e.g. NMBR. 

[185] In some embodiments, modulators of NNAT comprising the amino 
5 acid sequence of SEQ ID NO: 1 88, 1 90, 1 92 or 1 94, may be identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays. 

[1 86] In some embodiments, modulators of OLFM2 comprising the amino 
acid sequence of SEQ ID NO: 196, 197, 199 or 201, may be identified using, e.g., expression 
assays, promoter-reporter assays, or modulator binding assays. 
10 [187] In some embodiments, modulators of OPN3 comprising the amino acid 

sequence of SEQ ID NO: 203, 205, 207, 209 or 21 1, may be identified using, e.g., expression 
assays, promoter-reporter assays, or modulator binding assays. 

[188] In some embodiments, modulators of PTPRE comprising the amino 
acid sequence of SEQ ID NO: 213, 215, 217, 219 or 221, may be identified using, e.g., 
1 5 expression assays, promoter-reporter assays, modulator binding assays, or screening assays 
based on a receptor protein tyrosine phosphatase activity or phosphorylation and activity of 
MAPK. 

[189] In some embodiments, modulators of RDC1 comprising the amino acid 
sequence of SEQ ID NO: 223, 225 or 227, may be identified using, e.g., expression assays, 
20 promoter-reporter assays, or modulator binding assays. 

[190] In some embodiments, modulators of SLIT2 comprising the amino 
acid sequence of SEQ ID NO: 229, 230, 232 or 234, may be identified using, e.g., expression 
assays, promoter-reporter assays or modulator binding assays. Modulators which bind to the 
SLIT2 can be screened by a ligand binding assay method using, e.g., roundabout receptor 
25 ROBOl. 

[191] In some embodiments, modulators of TNFRSF21 comprising the 
amino acid sequence of SEQ ID NO: 236, 238 or 240, may be identified using, e.g., 
expression assays, promoter-reporter assays, modulator binding assays, or screening methods 
that monitor modulator-induced fluctuation in apoptosis and activation of both NF-kappaB 
30 and JNK. 

[192] In some embodiments, modulators of TNFSF13B comprising the 
amino acid sequence of SEQ ID NO: 242, 243, 245, 247 or 249, may be identified using, e.g., 
expression assays, promoter-reporter assays, modulator binding assays based on a receptor 
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activity using e.g. TNFRSFBb, TNFRSF13c or TNFRSF17 or screening methods that 
monitor modulator-induced fluctuation in activation of NF-kappaB. 

[193] In some embodiments, modulators of TNFSF14 comprising the amino 
acid sequence of SEQ ED NO: 25 1 , 252, 254, 256, 258 or 260, may be identified using, e.g., 
5 expression assays, promoter-reporter assays, or modulator binding assays based on a receptor 
activity using e.g. TNFRSF14. 

[1 94] In some embodiments, modulators of TPSB2 comprising the amino 
acid sequence of SEQ ID NO: 262, 263, 265 or 267, may be identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays, or screening methods that 
10 monitor modulator-induced fluctuation in serine-type peptidase activity. 

(195] In some embodiments, modulators of WISP2 comprising the amino 
acid sequence of SEQ ID NO: 269, 270, 272 or 274, may be identified using, e.g., expression 
assays, promoter-reporter assays, modulator binding assays, or screening methods that 
monitor modulator-induced fluctuation in proliferative rate of vascular smooth muscle cells. 

15 

1. Polypeptide Binding Assays 

[196] Preliminary screens can be conducted by screening for agents capable 
of binding to polypeptides of the invention, as at least some of the agents so identified are 
likely modulators of a polypeptide of the invention. Binding assays are also useful, e.g., for 

20 identifying endogenous proteins that interact with polypeptides of the invention. For 

example, antibodies, receptors or other molecules that bind polypeptides of the invention can 
be identified in binding assays. 

[197] Binding assays usually involve contacting a polypeptide of the 
invention with one or more test agents and allowing sufficient time for the protein and test 

25 agents to form a binding complex. Any binding complexes formed can be detected using any 
of a number of established analytical techniques. Protein binding assays include, but are not 
limited to, methods that measure co-precipitation or co-migration on non-denaturing SDS- 
polyacrylamide gels, and co-migration on Western blots (see, e.g., Bennet, J.P. and 
Yamamura, H.L (1985) "Neurotransmitter, Hormone or Drug Receptor Binding Methods," in 

30 Neurotransmitter Receptor Binding (Yamamura, H. L, et aL, eds.), pp. 61-89. Other binding 
assays involve the use of mass spectrometry or NMR techniques to identify molecules bound 
to a polypeptide of the invention or displacement of labeled substrates. The polypeptides of 
the invention utilized in such assays can be naturally expressed, cloned or synthesized. 
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[198 J In addition, mammalian or yeast two-hybrid approaches (see, e.g, 
Bartel, P.L. eu al. Methods Enzymol, 254:241 (1995)) can be used to identify polypeptides or 
other molecules that interact or bind when expressed together in a host cell. 



5 2. Polypeptide Activity 

[199] The activity of polypeptides of the invention can be assessed using a 
variety of in vitro and in vivo assays to determine functional, chemical, and physical effects, 
e.g tf measuring ligand binding (e.g., radioactive or otherwise labeled ligand binding), second 
messengers (e.g., cAMP, cGMP, IP3, DAG, or Ca 2+ ), ion flux, phosphorylation levels, 

1 0 transcription levels, and the like. Measurement of such functional, chemical and/or physical 
effects may be direct (e.g., directly detecting calcium flux) or indirect (e.g., detecting changes 
in expression or activity of gene products that are known to be modulated by the effects such 
as calcium flux or others listed above). Furthermore, such assays can be used to test for 
inhibitors and activators of the polypeptides of the invention. Modulators can also be 

1 5 genetically altered versions of polypeptides of the invention. 

[200J The polypeptide of the assay will be selected from a polypeptide with 
substantial identity to a sequence of SEQ ED NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 1 12, 1 14, 116, 

20 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 
192, 194, 196, 197, 199,201,203, 205,207, 209,211,213,215,217,219,221,223, 225, 227, 
229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 
263, 265, 267, 269, 270, 272, 274 or other conservatively modified variants thereof. 

25 Generally, the amino acid sequence identity will be at least 70%, optionally at least 85%, 
optionally at least 90, or optionally at least 95% to the polypeptides exemplified herein. 
Optionally, the polypeptide of the assays will comprise a fragment of a polypeptide of the 
invention, such as an extracellular domain, transmembrane domain, cytoplasmic domain, 
ligand binding domain,. subunit association domain, active site, and the like. Either a 

30 polypeptide of the invention or a domain thereof can be covalently linked to a heterologous 
protein to create a chimeric protein used in the assays described herein. 

[201 1 Modulators of polypeptide activity are tested using either recombinant 
or naturally occurring polypeptides of the invention. The protein can be isolated, expressed 
in a cell, expressed in a membrane derived from a cell, expressed in tissue or in an animal, 
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either recombinant or naturally occurring. For example, tissue slices, dissociated cells, e.g., 
from tissues expressing polypeptides of the invention, transformed cells, or membranes can 
be used. Modulation is tested using one of the in vitro or in vivo assays described herein. 

[202] Modulator binding to polypeptides of the invention, a domain, or 
chimeric protein can be tested in solution, in a bilayer membrane, attached to a solid phase, in 
a lipid monolayer, or in vesicles. Binding of a modulator can be tested using, eg, changes in 
spectroscopic characteristics (eg., fluorescence, absorbance, refractive index), hydrodynamic 
(e.g., shape), chromatographic, or solubility properties. 

[203J Samples or assays that are treated with a potential modulator (e.g., a 
"test compound") are compared to control samples without the test compound, to examine the 
extent of modulation. Control samples (untreated with activators or inhibitors) are assigned a 
relative activity value of 100. Inhibition of the polypeptides of the invention is achieved 
when the activity value relative to the control is about 90%, optionally 50%, optionally 25- 
0%. Activation of the polypeptides of the invention is achieved when the activity value 
relative to the control is 1 10%, optionally 150%, 200%, 300%, 400%, 500%, or 1000-2000%. 

3. Expression Assays 

|204J Screening for a compound that modulates the expression of a 
polynucleotide or a polypeptide of the invention is also provided. Screening methods 
generally involve conducting cell-based assays in which test compounds are contacted with 
one or more cells expressing a polynucleotide or a polypeptide of the invention, and then 
detecting an increase or decrease in expression (either transcript or translation product). 
Assays can be performed with any cells that express a polynucleotide or a polypeptide of the 
invention. 

[205J Expression can be detected in a number of different ways. As 
described infra, the expression level of a polynucleotide of the invention in a cell can be 
determined by probing the mRNA expressed in a cell with a probe that specifically hybridizes 
with a transcript (or complementary nucleic acid derived there from) of a polynucleotide of 
the invention. Probing can be conducted by lysing the cells and conducting Northern blots or 
without lysing the cells using in jto-hybridization techniques. Alternatively, a polypeptide 
of the invention can be detected using immunological methods in which a cell lysate is 
probed with antibodies that specifically bind to the polypeptide. 

[206] Promoter-reporter assays can be carried out using mammalian cells 
transfected with a reporter gene operably linked to sequences derived from the promoter 
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regions of genes encoding the polypeptides of the invention. The increased or decreased 
expression of the reporter gene can be detected in the presence and absence of the modulator. 
Expression of reporter genes may be detected by hybridization to a complementary nucleic 
acid, by using an immunological reagent, by assaying for an activity of the reporter gene 
5 product, or other methods known to those in the art . 

[207] The level of expression or activity of a polynucleotide or a polypeptide 
of the invention can be compared to a baseline value. The baseline value can be a value for a 
control sample or a statistical value that is representative of expression levels of a 
polynucleotide or a polypeptide of the invention for a control population (e.g., lean 

10 individuals as described herein) or cells (e.g., tissue culture cells not exposed to a modulator). 
Expression levels can also be determined for cells that do not express the polynucleotide or a 
polypeptide of the invention as a negative control. Such cells generally are otherwise 
substantially genetically the same as the test cells. 

[208) A variety of different types of cells can be utilized in the reporter 

1 5 assays. Cells that do not endogenously express a polypeptide of the invention can be 
prokaryotic, but are preferably eukaryotic. The eukaryotic cells can be any of the cells 
typically utilized in generating cells that harbor recombinant nucleic acid constructs. 
Exemplary eukaryotic cells include, but are not limited to, yeast, and various higher 
eukaryotic celts such as the HEK293, HepG2, COS, CHO and HeLa cell lines. 

20 [209] Various controls can be conducted to ensure that an observed activity 

is authentic including running parallel reactions with cells that lack the reporter construct or 
by not contacting a cell harboring the reporter construct with test compound. Compounds 
can also be further validated as described below. 

25 4. Validation 

|210] Agents that are initially identified by any of the foregoing screening 
methods can be further tested to validate the apparent activity. Alternatively, potential 
modulators can be tested initially using the forgoing validation assays without preliminary 
screening. 

30 [211] Modulators that are selected for further study can be tested for anti- 

diabetic efTects using the "classic" insulin responrivexell line, mouse 3T3-L1 adipocytes, 
muscle cells such as L6 cells and the like. Cells (e.g., adipocytes or muscle cells) are pre- 
incubated with the modulators and tested for acute (up to 4 hours) and chronic (overnight) 
effects on basal and insulin-stimulated GLUT4 translocation and glucose uptake. 
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[21 2] Modulators that are selected for further study can be tested for anti- 
obesity effects using any adipocyte or adippgenic cell, e.g., mouse cell line 3T3-L1 
adipocytes, freshly isolated rodent or human adipocytes, undifferentiated adipogenic cells and 
the like. Cells (e.g., adipocytes cells) are pre-incubated with the modulators and tested for 
5 acute (up to 4 hours) and chronic (overnight or longer) effects on basal and insulin-stimulated 
release of adipogenic factors, adipocyte cell size, leptin and TNF alpha release, and/or lipid 
metabolism. Undifferentiated adipogenic cells can be pre-incubated with the modulators and 
tested for effects on differentiation into adipocytes (including changes in differentiation 
markers) and/or triglyceride accumulation* 

10 [213] The response of this increase in body weight can be determined at an 

organismal, tissue or cellular level. For example, increased fasting blood leptin levels are 
indicative of obesity. Other methods of measuring obesity include, e.g., calculation of BMI, 
waist-to-hip ratio, total body fat, measuring the blood levels of a variety of secreted proteins 
which have been shown to correlate to obesity (IL-6, TNF alpha) and measuring the fasted 

1 5 blood levels of free fatty acids. 

[214] Following such studies, validity of the modulators is tested in suitable 
animal models. The basic format of such methods involves administering a lead compound 
identified during an initial screen to an animal that serves as a model for humans and then 
determining if expression of activity of a polypeptide of the invention is in fact modulated. 

20 [21 5] The effect of the compound will be assessed in either obese animals, 

diabetic animals or in diet induced insulin resistant animals. The body weight loss, blood 
glucose and insulin levels will be determined. The animal models utilized in validation 
studies generally are mammals of any kind. Specific examples of suitable animals include, 
but are not limited to, primates, mice and rats. Monogenic models of diabetes (e.g., ob/ob 

25 and db/db mice, Zucker rats and Zucker Diabetic Fatty rats, etc.) or polygenic models of 
diabetes (e.g., OLETF rats, GK rats, NSY mice, and KK mice) can be useful for validating 
modulation of a polypeptide of the invention in a diabetic or insulin resistant animal. In 
addition, transgenic animals expressing human polypeptides of the invention can be used to 
further validate drug candidates. 

30 [216] Monogenic models of obesity (e.g., OLETF, tubby, mahogany, agouti, 

ob/ob and db/db mice etc) or polygenic models of obesity (e.g., 'high fat diet-induced obese 
animals, NZO mice, KK mice, Wellesley mice, GK rats, etc.)can be useful for validating 
modulation of a polypeptide of the invention in an obese animal. The most widely used 
criteria for assessing the efficacy of anti-obesity treatments are those from the FDA. The 
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FDA defines a body weight loss of >5% as statistically significant compared to placebo. 
However, it will be appreciated that any detectable change in body weight following 
administration of a modulator of the invention can be considered a relevant result. 

5 C. Solid Phase and Soluble High Throughput Assays 

[21 7J In the high throughput assays of the invention, it is possible to screen 
up to several thousand different modulators or ligands in a single day. In particular, each 
well of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or, if concentration or incubation time effects are to be observed, every 5-10 wells 
10 can test a single modulator. Thus, a single standard microtiter plate can assay about 100 (e.g., 
96) modulators. If 1536 well plates are used, then a single plate can easily assay from about 
100 to about 1 500 different compounds. It is possible to assay several different plates per 
day; assay screens for up to about 6,000-20,000 or more different compounds are possible 
using the integrated systems of the invention. In addition, microfluidic approaches to reagent 
1 5 manipulation can be used. 

[218] A molecule of interest (e.g., a polypeptide or polynucleotide of the 
invention, or a modulator thereof) can be bound to the solid-state component, directly or 
indirectly, via covalent or non-covalent linkage, e.g., via a tag. The tag can be any of a 
variety of components. In general, a molecule that binds the tag (a tag binder) is fixed to a 
20 solid support, and the tagged molecule of interest is attached to the solid support by 
interaction of the tag and the tag binder. 

[219] A number of tags and tag binders can be used, based upon known 
molecular interactions well described in the literature. For example, where a tag has a natural 
binder, for example, biotin, protein A, or protein G, it can be used in conjunction with 
25 appropriate tag binders (avidin, streptavidin, neutravidin, the Fc region of an immunoglobulin, 
poly-His, etc.) Antibodies to molecules with natural binders such as biotin are also widely 
available and appropriate tag binders (see, SIGMA Immunochemicals 1998 catalogue 
SIGMA, St. Louis MO). 

[220] Similarly, any haptenic or antigenic compound can be used in 
30 combination with an appropriate antibody to form a tag/tag binder pair. Thousands of 

specific antibodies are commercially available and many additional antibodies are described 
in the literature. For example, in one common configuration, the tag is a first antibody and 
the tag binder is a second antibody that recognizes the first antibody. In addition to antibody- 

56 

BNSOOCIO: <WO__2006007400A2.L> 



WO 2006/007400 



PCT/US2005/021297 



antigen interactions, receptor-ligand interactions are also appropriate as tag and tag-binder 
pairs, such as agonists and antagonists of cell membrane receptors (e.g., cell receptor-ligand 
interactions such as transferrin, c-kit, viral receptor ligands, cytokine receptors, chemokine 
receptors, interleukin receptors, immunoglobulin receptors and antibodies, the cadherin 
5 family, the integrin family, the selectin family, and the like; see, e.g., Pigott & Power, The 
Adhesion Molecule Facts Book I (1993)). Similarly, toxins and venoms, viral epitopes, 
hormones (e.g., opiates, steroids, etc.), intracellular receptors (e.g, which mediate the effects 
of various small ligands, including steroids, thyroid hormone, retinoids and vitamin D; 
peptides), drugs, lectins, sugars, nucleic acids (both linear and cyclic polymer configurations), 
1 0 oligosaccharides, proteins, phospholipids and antibodies can all interact with various cell 
receptors. 

[221 ] Synthetic polymers, such as polyurethanes, polyesters, polycarbonates, 
polyureas, polyamides, polyethyleneimines, polyarylene sulfides, polysiloxanes, polyimides, 
and polyacetates can also form an appropriate tag or tag binder. Many other tag/tag binder 

15 pairs are also useful in assay systems described herein, as would be apparent to one of skill 
upon review of this disclosure. 

[222J Common linkers such as peptides, polyethers, and the like can also 
serve as tags, and include polypeptide sequences, such as poly-gly sequences of between 
about 5 and 200 amino acids. Such flexible linkers are known to those of skill in the art. For 

20 example, po!y(ethelyne glycol) linkers are available from Shearwater Polymers, Inc., 

Huntsville, Alabama. These linkers optionally have amide linkages, sulfhydryl linkages, or 
heterofimctional linkages. 

J223J Tag binders are fixed to solid substrates using any of a variety of 
methods currently available. Solid substrates are commonly derivatized or functionalized by 

25 exposing all or a portion of the substrate to a chemical reagent that fixes a chemical group to 
the surface that is reactive with a portion of the tag binder. For example, groups that are 
suitable for attachment to a longer chain portion would include amines, hydroxyl, thiol, and 
carboxyl groups. Aminoalkylsilanes and hydroxyalkylsilanes can be used to functionalize a 
variety of surfaces, such as glass surfaces. The construction of such solid phase biopolymer 

30 arrays is well described in the literature (see, e.g., Meirifield, /. Am. Chem. Soc. 85:2149- 
2154 (1963) (describing solid phase synthesis of, e.g., peptides); Geysen et al, J. Immun. 
Meth. 102:259-274 (1987) (describing synthesis of solid phase components on pins); Frank 
and Doring, Tetrahedron 44:60316040 (1988) (describing synthesis of various peptide 
sequences on cellulose disks); Fodor et al, Science, 251:767-777 (1991); Sheldon et ai, 
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Clinical Chemistry 39(4):71 8-719 (1993); and Kozal et al, Nature Medicine 2(7):753759 
(1996) (all describing arrays of biopolymers fixed to solid substrates). Non-chemical 
approaches for fixing tag binders to substrates include other common methods, such as heat, 
cross-linking by UV radiation, and the like. 
5 [224J The invention provides in vitro assays for identifying, in a high 

throughput format, compounds that can modulate the expression or activity of a polypeptide 
of the invention. Control reactions that measure activity of a polypeptide of the invention in 
a cell in a reaction that does not include a potential modulator are optional, as the assays are 
highly uniform. Such optional control reactions are appropriate and increase the reliability of 

10 the assay. Accordingly, in some embodiments, the methods of the invention include such a 
control reaction. For each of the assay formats described, u no modulator" control reactions 
that do not include a modulator provide a background level of binding activity. 

[225] In some assays it will be desirable to have positive controls. At least 
two types of positive controls are appropriate. First, a known activator of a polypeptide or a 

1 5 polynucleotide of the invention can be incubated with one sample of the assay, and the 
resulting increase in signal resulting from an increased expression level or activity of a 
polypeptide or a polynucleotide of the invention are determined according to the methods 
herein. Second, a known inhibitor of a polypeptide or a polynucleotide of the invention can 
be added, and the resulting decrease in signal for the expression or activity of a polypeptide 

20 or a polynucleotide of the invention can be similarly detected. It will be appreciated that 
modulators can also be combined with activators or inhibitors to find modulators that inhibit 
the increase or decrease that is otherwise caused by the presence of the known modulator of a 
polypeptide or a polynucleotide of the invention. 

VII. COMPOSITIONS, KITS AND INTEGRATED SYSTEMS 

25 (226) The invention provides compositions, kits and integrated systems for 

practicing the assays described herein using nucleic acids or polypeptides of the invention, 
antibodies, etc. 

[227] The invention provides assay compositions for use in solid phase 
assays; such compositions can include, for example, one or more nucleic acids encoding a 
30 polypeptide of the invention immobilized on a solid support, and a labeling reagent. In each ■» 
case, the assay compositions can also include additional reagents that are desirable for 
hybridization. Modulators of expression or activity of a polypeptide of the invention can also 
be included in the assay compositions. 
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[228] The invention also provides kits for carrying out the assays of the 
invention. The kits typically include a probe that comprises (1) an antibody that specifically 
binds to a polypeptide of the invention or (2) a polynucleotide sequence encoding at least a 
fragment of such polypeptides, and a label for detecting the presence of the probe. The kits 
may include at least one polynucleotide sequence encoding a polypeptide of the invention. 
Kits can include any of the compositions noted above, and optionally further include 
additional components such as instructions to practice a high-throughput method of assaying 
for an effect on expression of the genes encoding a polypeptide of the invention, or on 
activity of a polypeptide of the invention, one or more containers or compartments (e.g., to 
hold the probe, labels, or the like), a control modulator of the expression or activity of a 
polypeptide of the invention, a robotic armature for mixing kit components or the like. 

[229J The invention also provides integrated systems for high-throughput 
screening of potential modulators for an effect on the expression or activity of a polypeptide 
of the invention. The systems can include a robotic armature which transfers fluid from a 
source to a destination, a controller which controls the robotic armature, a label detector, a 
data storage unit which records label detection, and an assay component such as a microtiter 
dish comprising a well having a reaction mixture or a substrate comprising a fixed nucleic 
acid or immobilization moiety. 

[230] A number of robotic fluid transfer systems are available, or can easily 
be made from existing components. For example, a Zymate XP (Zymark Corporation; 
Hopkinton, MA) automated robot using a Microlab 2200 (Hamilton; Reno, NV) pipetting 
station can be used to transfer parallel samples to 96 well microtiter plates to set up several 
parallel simultaneous binding assays. 

[231] Optical images viewed (and, optionally, recorded) by a camera or other 
recording device (e.g., a photodiode and data storage device) are optionally further processed 
in any of the embodiments herein, e.g. t by digitizing the image and storing and analyzing the 
image on a computer. A variety of commercially available peripheral equipment and 
software is available for digitizing, storing and analyzing a digitized video or digitized optical 
image. 

[232] One conventional system carries light from the specimen field to a 
cooled charge-coupled- device (CCD) camera, in common use in the art. A CCD camera 
includes an array of picture elements (pixels). The light from the specimen is imaged on the 
CCD. Particular pixels corresponding to regions of the specimen (e.g., individual 
hybridization sites on an array of biological polymers) are sampled to obtain light intensity 
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readings for each position. Multiple pixels are processed in parallel to increase speed. The 
apparatus and methods of the invention are easily used for viewing any sample, e.g., by 
fluorescent or dark field microscopic techniques. 

VIII. ADMINISTRATION AND PHARMACEUTICAL COMPOSITIONS 
5 [233] Modulators of the polypeptides of the invention (e.g., antagonists or 

agonists including polypeptides of the invention, fragments thereof, or fusions comprising the 
polypeptides or fragments which have antagonist activity or an additive effect on overall 
polypeptide activity) can be administered directly to the mammalian subject (typically in 
need thereof due to a pre-diabetic, diabetic or obese condition) for modulation of activity of a 

10 polypeptide of the invention in vivo. Administration is by any of the routes normally used for 
introducing a modulator compound into ultimate contact with the tissue to be treated and is 
well known to those of skill in the art. Although more than one route can be used to 
administer a particular composition, a particular route can often provide a more immediate 
and more effective reaction than another route. 

1 5 (234] The pharmaceutical compositions of the invention may comprise a 

pharmaceutical^ acceptable carrier. Pharmaceutically acceptable carriers are determined in 
part by the particular composition being administered, as well as by the particular method 
used to administer the composition. Accordingly, there are a wide variety of suitable 
formulations of pharmaceutical compositions of the present invention (see, e.g., Remington *s 

20 Pharmaceutical Sciences, 1 7 th ed. 1 985)). 

[235] The modulators (e.g., agonists or antagonists) of the expression or 
activity of a polypeptide of the invention, alone or in combination with other suitable 
components, can be prepared for injection or for use in a pump device. Pump devices (also 
known as "insulin pumps") are commonly used to administer insulin to patients and therefore 

25 can be easily adapted to include compositions of the present invention. Manufacturers of 
insulin pumps include Animas, Disetronic and MiniMed. 

[236] The modulators (e.g., agonists or antagonists) of the expression or 
activity of a polypeptide of the invention, alone or in combination with other suitable 
components, can be made into aerosol formulations (i.e., they can be "nebulized") to be 

30 administered via inhalation. Aerosol formulations can be placed into pressurized acceptable 
propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like. 

[237] Formulations suitable for administration include aqueous and non- 
aqueous solutions, isotonic sterile solutions, which can contain antioxidants, buffers, 
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bacteriostats, and solutes that render the formulation isotonic, and aqueous and non-aqueous 
sterile suspensions that can include suspending agents, solubilizers, thickening agents, 
stabilizers, and preservatives. In the practice of this invention, compositions can be 
administered, for example, orally, nasally, topically, intravenously, intraperitoneal^, or 
5 intrathecally. The formulations of compounds can be presented in unit-dose or multi-dose 
sealed containers, such as ampoules and vials. Solutions and suspensions can be prepared 
from sterile powders, granules, and tablets of the kind previously described. The modulators 
can also be administered as part of a prepared food or drug. 

[238J The dose administered to a patient, in the context of the present 
10 invention should be sufficient to induce a beneficial response in the subject over time. The 
optimal dose level for any patient will depend on a variety of factors including the efficacy of 
the specific modulator employed, the age, body weight, physical activity, and diet of the 
patient, on a possible combination with other drugs, and on the severity of the case of 
diabetes. It is recommended that the daily dosage of the modulator be determined for each 
1 5 individual patient by those skilled in the art in a similar way as for known insulin 

compositions. The size of the dose also will be determined by the existence, nature, and 
extent of any adverse side-effects that accompany the administration of a particular 
compound or vector in a particular subject. 

[239] In determining the effective amount of the modulator to be 
20 administered a physician may evaluate circulating plasma levels of the modulator, modulator 
toxicity, and the production of anti-modulator antibodies. In general, the dose equivalent of a 
modulator is from about 1 ng/kg to 10 mg/kg for a typical subject. 

1240] For administration, modulators of the present invention can be 
administered at a rate determined by the LD-50 of the modulator, and the side-effects of the 
25 modulator at various concentrations, as applied to the mass and overall health of the subject. 
Administration can be accomplished via single or divided doses. 

[241] The compounds of the present invention can also be used effectively in 
combination with one or more additional active agents depending on the desired target 
therapy (see, e.g., Turner, N. et al Prog. Drug Res. (1998) 51: 33-94; Haffher, S. Diabetes 
30 Care (1998) 21 : 160-178; and DeFronzo, R. et al. (eds.), Diabetes Reviews (1997) Vol. 5 No. 
4). A number of studies have investigated the benefits of combination therapies with oral 
agents (see, e.g., Mahler, IL, J. Clin. Endocrinol Metab. (1999) 84: 1 165-71 ; United 
Kingdom Prospective Diabetes Study Group: UKPDS 28, Diabetes Care (1998) 21 : 87-92; 
Bardin, C. W.,(ed.), Current Therapy In Endocrinology And Metabolism, 6th Edition (Mosby 
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- Year Book, Inc., St. Louis, MO 1997); Chiasson, J. et al., Ann. Intern. Med. (1994) 121: 
928-935; Coniff, R. et al., Clin. Ther. (1997) 19: 16-26; Coniff, R. et al., Am. J. Med (1995) 
98: 443-451; and Iwamoto, Y. et al., Diabet. Med (1996) 13 365-370; Kwiterovich, P. Am. J. 
Cardiol (1998) 82(12A): 3U-17U). These studies indicate that modulation of diabetes, 
5 among other diseases, can be further improved by the addition of a second agent to the 

therapeutic regimen. Combination therapy includes administration of a single pharmaceutical 
dosage formulation that contains a modulator of the invention and one or more additional 
active agents, as well as administration of a modulator and each active agent in its own 
separate pharmaceutical dosage formulation. For example, a modulator and a 

10 thiazolidinedione can be administered to the human subject together in a single oral dosage 
composition, such as a tablet or capsule, or each agent can be administered in separate oral 
dosage formulations. Where separate dosage formulations are used, a modulator and one or 
more additional active agents can be administered at essentially the same time (i.e., 
concurrently), or at separately staggered times (i.e., sequentially). Combination therapy is 

1 5 understood to include all these regimens. 

[242J One example of combination therapy can be seen in treating pre- 
diabetic individuals (e.g., to prevent progression into type 2 diabetes) or diabetic individuals 
(or treating diabetes and its related symptoms, complications, and disorders), wherein the 
modulators can be effectively used in combination with, for example, sulfonylureas (such as 

20 chlorpropamide, tolbutamide, acetohexamide, tolazamide, glyburide, gliclazide, glynase, 
glimepiride, and glipizide); biguanides (such as metformin); a PPAR beta delta agonist; a 
ligand or agonist of PPAR gamma such as thiazolidinediones (such as ciglitazone, 
pioglitazone (see, e.g., U.S. Patent No. 6,218,409), troglitazone, and rosiglitazone (see, e.g., 
US. Patent No. 5,859,037)); PPAR alpha agonists such as clofibrate, gemfibrozil, fenofibrate, 

25 ciprofibrate, and bezafibrate; dehydroepiandrosterone (also referred to as DHEA or its 

conjugated sulphate ester, DHEA-S04); anti glucocorticoids; TNFa inhibitors; o-glucosidase 
inhibitors (such as acarbose, miglitol, and voglibose); amylin and amylin derivatives (such as 
pramlintide, (see, also, U.S. Patent Nos. 5,902,726; 5,124,314; 5,175,145 and 6,143,718.)); 
insulin secretogogues (such as repaglinide, gliquidone, and nateglinide (see t also, U.S. Patent 

30 Nos. 6,251,856; 6,251,865; 6,221,633; 6,174,856)), and insulin. 

[243] The modulators of the invention can also be combined with anti- 
obesity drugs (e.g., Xenical (Orlistat), Merida (Sibutramine) or Adipex-P (Phentermine)) or 
appetite-suppressing drugs. 
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IX. GENE THERAPY 

[244) Conventional viral and non-viral based gene transfer methods can be 
used to introduce nucleic acids encoding engineered amino acid sequences comprising the 
polypeptides of the invention in mammalian cells or target tissues. Such methods can be used 
to administer nucleic acids encoding amino acid sequences comprising polypeptides of the 
invention to cells in vitro. In some embodiments, the nucleic acids encoding amino acid 
sequences comprising polypeptides of the invention are administered for in vivo or ex vivo 
gene therapy uses. Non-viral vector delivery systems include DNA plasmids, naked nucleic 
acid, and nucleic acid complexed with a delivery vehicle such as a liposome. Viral vector 
delivery systems include DNA and RNA viruses, which have either episomal or integrated 
genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson, 
Science 256:808-813 (1992); Nabel & Feigner, TIBTECH 1 1:21 1-217 (1993); Mitani & 
Caskey, TIBTECH 11:162-166 (1993); Dillon, TIBTECH 11:167-175 (1993); Miller, Nature 
357:455-460 (1992); Van Brunt, Biotechnology 6(10):1 149-1 154 (1988); Vigne, Restorative 
Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 
51(l):31-44 (1995); Haddada et al, in Current Topics in Microbiology and Immunology 
Doerfler and BShm (eds) (1995); and Yu etal t Gene Therapy 1:13-26 (1994). 

[245] Methods of non-viral delivery of nucleic acids encoding engineered 
polypeptides of the invention include lipofection, microinjection, biolistics, virosomes, 
liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, 
artificial virions,, and agent-enhanced uptake of DNA. Lipofection is described in e.g., US 
5,049,386, US 4,946,787; and US 4,897,355) and lipofection reagents are sold commercially 
(e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for 
efficient receptor-recognition lipofection of polynucleotides include those of Feigner, WO 
91/17424, WO 91/16024. Delivery can be to cells (ex vivo administration) or target tissues 
(in vivo administration). 

[246] The preparation of lipid:nucleic acid complexes, including targeted 
liposomes such as immunolipid complexes, is well known to one of skill in the art (see t e.g., 
Crystal, Science 270:404-410 (1995); Blaese etal, Cancer Gene Ther. 2:291-297 (1995); 
Behr et ai, Bioconjugate Chem. 5:382-389 (1994); Remy et ul. 9 Bioconjugate Chem. 5:647- 
654 (1994); Gao etal, Gene Therapy 2:710-722 (1995); Ahmad etal, Cancer Res. 52:4817- 
4820 (1992); U.S. Pat. Nos. 4,1 86,183, 4,217,344, 4,235,871, 4,261,975, 4,485,054, 
4,501,728, 4,774,085, 4,837,028, and 4,946,787). 
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[247] The use of RNA or DNA viral based systems for the delivery of 
nucleic acids encoding engineered polypeptides of the invention take advantage of highly 
evolved processes for targeting a virus to specific cells in the body and trafficking the viral 
payload to the nucleus. Viral vectors can be administered directly to patients (in vivo) or they 
5 can be used to treat cells in vitro and the modified cells are administered to patients (ex vivo). 
Conventional viral based systems for the delivery of polypeptides of the invention could 
include retroviral, lentivirus, adenoviral, adeno-associated and herpes simplex virus vectors 
for gene transfer. Viral vectors are currently the most efficient and versatile method of gene 
transfer in target cells and tissues. Integration in the host genome is possible with the 

10 retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in 
long term expression of the inserted transgene. Additionally, high transduction efficiencies 
have been observed in many different cell types and target tissues. 

[2481 The tropism of a retrovirus can be altered by incorporating foreign 
envelope proteins, expanding the potential target population of target cells. Lentiviral vectors 

15 are retroviral vectors that are able to transduce or infect non-dividing cells and typically 
produce high viral titers. Selection of a retroviral gene transfer system would therefore 
depend on the target tissue. Retroviral vectors are comprised of ro-acting long terminal 
repeats with packaging capacity for up to 6*10 kb of foreign sequence. The minimum ex- 
acting LTRs are sufficient for replication and packaging of the vectors, which are then used 

20 to integrate the therapeutic gene into the target cell to provide permanent transgene 

expression. Widely used retroviral vectors include those based upon murine leukemia virus 
(MuLV), gibbon ape leukemia vims (GaLV), Simian Immuno deficiency virus (SIV), human 
immuno deficiency virus (HIV), and combinations thereof (see, e.g., Buchscher et a/., J. 
Virol. 66:2731-2739 (1992); Johann et al t J. Virol. 66:1635-1640 (1992); Sommerfelt et al. 9 

25 Virol. 176:58-59 (1990); Wilson et al. 9 J. Virol. 63:2374-2378 (1989); Miller et a/., J. Virol. 
65:2220-2224 (1991); PCT/US94/05700). 

[249] In applications where transient expression of the polypeptides of the 
invention is preferred, adenoviral based systems are typically used. Adenoviral based vectors 
are capable of very high transduction efficiency in many cell types and do not require cell 

30 division. With such vectors, high titer and levels of expression have been obtained. This 
K vector can be produced in large quantities in a relatively simple system. Adeno-associated 

vims ("AAV") vectors are also used to transduce cells with target nucleic acids, e.g., in the in 
vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy 
procedures (see, e.g. 9 West et ai, Virology 160:38-47 (1987); U.S. Patent No. 4,797,368; WO 
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93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest 
94:1351 (1994)). Construction of recombinant AAV vectors are described in a number of 
publications, including U.S. Pat. No. 5,173,414; Tratschine/a/., Mol Cell Biol 5:3251-3260 
(1985); Tratschin, et al, Mol Cell Biol 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 
5 81 :6466-6470 (1984); and Samulski et a/., / Virol 63:03822-3828 (1989). 

[2S0] pLASN and MFG-S are examples are retroviral vectors that have been 
used in clinical trials (Dunbar et al. Blood 85:3048-305 (1995); Kohn et al., Nat. Med. 
1:1017-102 (1995); Malech etal t PNAS 94:22 12133-12138 (1997)). PA317/pLASN was 
the first therapeutic vector used in a gene therapy trial. (Blaese et al, Science 270:475-480 

1 0 (1 995)). Transduction efficiencies of 50% or greater have been observed for MFG-S 

packaged vectors. (Ellem et al, Immunol Immunother. 44(l):10-20 (1997); Dranoff et al, 
Hum. Gene Ther. 1:111-2 (1997). 

[251 J Recombinant adeno-associated virus vectors (rAAV) are a promising 
alternative gene delivery systems based on the defective and nonpathogenic parvovirus 

1 5 adeno-associated type 2 virus. All vectors are derived from a plasmid that retains only the 
AAV 145 bp inverted terminal repeats flanking the transgene expression cassette. Efficient 
gene transfer and stable transgene delivery due to integration into the genomes of the 
transduced cell are key features for this vector system. (Wagner et al, Lancet 351:91 17 1702- 
3 (1998), Keams et al, Gene Ther. 9:748-55 (1996)). 

20 ■ (252) Replication-deficient recombinant adenoviral vectors (Ad) can be 

engineered such that a transgene replaces the Ad El a, El b, and E3 genes; subsequently the 
replication defector vector is propagated in human 293 cells that supply deleted gene function 
in trans. Ad vectors can transduce multiply types of tissues in vivo, including nondividing, 
differentiated cells such as those found in the liver, kidney and muscle system tissues. 

25 Conventional Ad vectors have a large carrying capacity. An example of the use of an Ad 
vector in a clinical trial involved polynucleotide therapy for antitumor immunization with 
intramuscular injection (Sterman et al, Hum. Gene Ther. 7:1083-9 (1998)). Additional 
examples of the use of adenovirus vectors for gene transfer in clinical trials include 
Rosenecker et al, Infection 24:1 5-10 (1996); Sterman et al., Hum. Gene Ther. 9:7 1083- 

30 1 089 (1998); Welsh et al, Hum. Gene Ther. 2:205-1 8 (1995); Alvarez et al. Hum. Gene Ther. 
5:597-613 (1997); Topf et al, Gene Ther. 5:507-5 13 (1998); Sterman et al, Hwr, Gene Ther. 
7:1083-1089(1998). 

[2531 Packaging cells are used to form virus particles that are capable of 
infecting a host cell. Such ceils include 293 cells, which package adenovirus, and #2 cells or 
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PA3 17 cells, which package retrovirus. Viral vectors used in gene therapy are usually 
generated by producer cell line that packages a nucleic acid vector into a viral particle. The 
vectors typically contain the minimal viral sequences required for packaging and subsequent 
integration into a host, other viral sequences being replaced by an expression cassette for the 

5 protein to be expressed. The missing viral functions are supplied in trans by the packaging 
cell line. For example, AAV vectors used in gene therapy typically only possess ITR 
sequences from the AAV genome which are required for packaging and integration into the 
host genome. Viral DNA is packaged in a cell line, which contains a helper plasmid 
encoding the other AAV genes, namely rep and cap, but lacking ITR sequences. The cell 

10 line is also infected with adenovirus as a helper. The helper virus promotes replication of the 
AAV vector and expression of AAV genes from the helper plasmid. The helper plasmid is 
not packaged in significant amounts due to a lack of ITR sequences. Contamination with 
adenovirus can be reduced by, e.g., heat treatment to which adenovirus is more sensitive than 
AAV. 

1 5 [254] In many gene therapy applications, it is desirable that the gene therapy 

vector be delivered with a high degree of specificity to a particular tissue type. A viral vector 
is typically modified to have specificity for a given cell type by expressing a ligand as a 
fusion protein with a viral coat protein on the viruses outer surface. The ligand is chosen to 
have affinity for a receptor known to be present on the cell type of interest. For example, 

20 Han et a/., PNAS 92:9747-975 1 (1 995), reported that Moloney murine leukemia virus can be 
modified to express human heregulin fused to gp70, and the recombinant virus infects certain 
human breast cancer cells expressing human epidermal growth factor receptor. This principle 
can be extended to other pairs of virus expressing a ligand fusion protein and target cell 
expressing a receptor. For example, filamentous phage can be engineered to display antibody 

25 fragments (e.g., FAB or Fv) having specific binding affinity for virtually any chosen cellular 
receptor. Although the above description applies primarily to viral vectors, the same 
principles can be applied to nonviral vectors. Such vectors can be engineered to contain 
specific uptake sequences thought to favor uptake by specific target cells. 

[255] Gene therapy vectors can be delivered in vivo by administration to an 

30 individual patient, typically by systemic administration (e.g., intravenous, intraperitoneal, 

intramuscular, subdermal, or intracranial infusion) or topical application, as described bekm • 
Alternatively, vectors can be delivered to cells ex vivo, such as cells explanted from an 
individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal 
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donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually 
after selection for cells which have incorporated the vector. 

[256] Ex vivo cell transfection for diagnostics, research, or for gene therapy 
(e.g., via re-infusion of the transfected cells into the host organism) is well known to those of 
S skill in the art. In some embodiments, cells are isolated from the subject organism, 

transfected with a nucleic acid (gene or cDNA) encoding a polypeptides of the invention, and 
re-infused back into the subject organism (e.g., patient). Various cell types suitable for ex 
vivo transfection are well known to those of skill in the art (see, e.g., Freshney et al, Culture 
of Animal Cells, A Manual of Basic Technique (3rd ed. 1994)) and the references cited 

10 therein for a discussion of how to isolate and culture cells from patients). 

[257] In one embodiment, stem cells are used in ex vivo procedures for cell 
transfection and gene therapy. The advantage to using stem cells is that they can be 
differentiated into other cell types in vitro, or can be introduced into a mammal (such as the 
donor of the cells) where they will engraft in the bone marrow. Methods for differentiating 

1 5 CD34+ cells in vitro into clinically important immune cell types using cytokines such a GM- 
CSF, EFN-Yand TNF-aare known (see Inaba et al, J. Exp, Med. 176:1693-1702 (1992)). 

(258] Stem cells are isolated for transduction and differentiation using 
known methods. For example, stem cells are isolated from bone marrow cells by panning the 
bone marrow cells with antibodies which bind unwanted cells, such as CD4+ and CD8+ (T 

20 cells), CD45+ (panB cells), GR-1 (granulocytes), and lad (differentiated antigen presenting 
cells) (see Inaba et al, 1 Exp. Med. 176:1693-1702 (1992)). 

[259] Vectors (e.g., retroviruses, adenoviruses, liposomes, etc.) containing 
therapeutic nucleic acids can be also administered directly to the organism for transduction of 
cells in vivo. Alternatively, naked DNA can be administered. Administration is by any of the 

25 routes normally used for introducing a molecule into ultimate contact with blood or tissue 
cells. Suitable methods of administering such nucleic acids are available and well known to 
those of skill in the art, and, although more than one route can be used to administer a 
particular composition, a particular route can often provide a more immediate and more 
effective reaction than another route. 

30 [260] Pharmaceutically acceptable carriers are determined in part by the 

particular composition being administered, as well as by the particular method used to 
administer the composition. Accordingly, there is a wide variety of suitable formulations of 
pharmaceutical compositions of the present invention, as described below (see, e.g., 
Remington 's Pharmaceutical Sciences, 1 7th ed., 1 989). 
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X. DIAGNOSIS OF OBESITY AND/OR DIABETES 

[261] The present invention also provides methods of diagnosing diabetes or 
obesity, or a predisposition of at least some of the pathologies of diabetes and/or obesity. 
Diagnosis can involve determination of a genotype of an individual (e.g., with SNPs) and 
5 comparison of the genotype with alleles known to have an association with the occurrence of 
obesity and/or diabetes. Alternatively, diagnosis also involves determining the level of a 
polypeptide or polynucleotide of the invention in a patient and then comparing the level to a 
baseline or range. Typically, the baseline value is representative of a polypeptide or 
polynucleotide of the invention in a healthy (e.g., lean) person. 

1 0 |262) As discussed above, variation of levels (e.g., low or high levels) of a 

polypeptide or polynucleotide of the invention compared to the baseline range indicates that 
the patient is either obese, at risk for becoming obese, diabetic or at risk of developing at least 
some of the pathologies of diabetes (e.g., pre-diabetic). The level of a polypeptide in a lean 
individual can be a reading from a single individual, but is typically a statistically relevant 

1 5 average from a group of lean individuals. The level of a polypeptide in a lean individual can 
be represented by a value, for example in a computer program. 

[263] In some embodiments, the level of polypeptide or polynucleotide of 
the invention is measured by taking a blood, urine or tissue sample from a patient and 
measuring the amount of a polypeptide or polynucleotide of the invention in the sample using 

20 any number of detection methods, such as those discussed herein. For instance, fasting and 
fed blood or urine levels can be tested. 

[264] In some embodiments, the baseline level and the level in a lean sample 
from an individual, or at least two samples from the same individual differ by at least about 
5%, 10%, 20%, 50%, 75%, 100%, 150%, 200%, 300%, 400%, 500%, 1000% or more. In 

25 some embodiments, the sample from the individual is greater by at least one of the above- 
listed percentages relative to the baseline level. In some embodiments, the sample from the 
individual is lower by. at least one of the above-listed percentages relative to the baseline 
level. 

[265] In some embodiments, the level of a polypeptide or polynucleotide of 
30 the invention is used to monitor the effectiveness of either anti-obese therapies such as 
orlistat or sibutramine, or, antidiabetic therapies such as thiazolidinediones, metformin, 
sulfonylureas and other standard therapies. In some embodiments the activity or expression 
of a polypeptide or polynucleotide of the invention will be measured prior to and after 
treatment of an obese patient with antiobese therapies, or, diabetic or pre-diabetic patients 
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with antidiabetic therapies as a surrogate marker of clinical effectiveness. For example, the 
greater the reduction in expression or activity of a polypeptide of the invention indicates 
greater effectiveness. 

[266J Glucose/insulin tolerance tests can also be used to detect the effect of 
5 glucose levels on levels of a polypeptide or polynucleotide of the invention. In glucose 
tolerance tests, the patient's ability to tolerate a standard oral glucose load is evaluated by 
assessing serum and urine specimens for glucose levels. Blood samples are taken before the 
glucose is ingested, glucose is given by mouth, and blood or urine glucose levels are tested at 
set intervals after glucose ingestion. Similarly, meal tolerance tests can also be used to detect 
1 0 the effect of insulin or food, respectively, on levels of a polypeptide or polynucleotide of the 
invention. 

[267J Body weight or other indicators of obesity can also be used to detect 
the effect of modulating the levels of a polypeptide or polynucleotide of the invention. 
Measurement of a subject's response can be evaluated by assessing serum for altered levels of 
15 obesity-associated gene products, e.g., leptin, TNF alpha or IL-6. 

[268J All publications and patent applications cited in this specification are 
herein incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 

[269] Although the foregoing invention has been described in some detail by 
20 way of illustration and example for purposes of clarity of understanding, it will be readily 
apparent to one of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit or 
scope of the appended claims. 



25 EXAMPLES 

[2701 The following examples are offered to illustrate, but not to limit the 
claimed invention. 

[271 ] In either obese insulin-resistant or type II diabetics, peripheral tissues, 
especially muscle and fat, are known to have an impaired ability to respond to insulin and 
30 hence to take up glucose. This defect in glucose metabolism is usually compensated for by 
increased secretion of insulin from the pancreas, thereby maintaining normal glucose levels. 
The majority of glucose disposal occurs in the muscle. A number of obese insulin resistant 
patients will progress to overt diabetics with time. The molecular defects underlying this 
peripheral insulin resistance in both the obese and type II diabetics are not well defined. 
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Genes in muscle or fat whose expression is altered in either or both the obese or type II 
diabetics when compared to lean individuals can be causative genes for either obesity, insulin 
resistance and/or diabetes and are able to predict the transition to diabetes. Modulators of 
such genes have the ability to reverse obesity, insulin resistance and restore normal insulin 
5 sensitivity, thereby improving whole body glucose homeostasis including for example insulin 
secretion. Modulators of such genes also have the ability to be used to pre-empt the 
transition from obesity-induced insulin resistance to diabetes. Modulators of such genes also 
have the ability to be used to reverse metabolic obesity-related diseases such as 
cardiovascular disease, hypertension or obesity-related cancer. 

1 0 1272] The molecular mechanism by which thiazolidinediones (TZDs) cause 

an increase in peripheral insulin sensitivity was studied. Genes in muscle or fat whose 
expression is altered by TZDs may lie on a pathway leading from TZD treatment to increased 
insulin sensitivity. Modulators of such genes can elicit the same effect as TZD treatment. 
Moreover, such modulators can lack some of the side effects of TZD. Gene expression 

1 5 profiling in cultures of primary human adipocytes treated with either pioglitazone or 

rosiglitazone were used to identify genes important for TZD action and therefore treatment of 
obesity, diabetes and/or insulin resistance. 

[273) Gene expression profiling was performed on tissue samples 
(subcutaneous adipose samples) obtained from lean, obese and diabetic individuals. Two 

20 studies were performed. In the first study, samples were isolated from all individuals after a 
5 hour hyperinsulinemic euglycemic clamp. 

[274] In the second study, subcutaneous adipose samples were obtained from 
lean (BMfc 25) and obese (BMI>30) individuals after an overnight fast. 

[275] In a third study samples were obtained from human subcutaneous and 

25 omental adipose tissues. Genes expressed only, or enriched, in fat can lie on pathways 

involved in insulin sensitivity, appetite suppression or lipid metabolism in the adipose itself 
or other peripheral tissues (e.g., muscle, liver, brain). For all tissue samples mRNA was 
isolated from these adipose samples and converted to cRNA by standard procedures. The 
gene expression profile for each individual was determined by hybridization of cRNA to 

30 commercial and custom synthesized Asymetrix chips. 

[276] Gene expression profile differences were calculated as follows. The 
expression level of a particular gene is indicated by its 'signal intensity*. The raw data was 
analyzed by a statistical test to remove 'outliers'. The mean 'signal intensity' was then 
calculated from the signal intensities for all individuals in a particular treatment group. 
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10 



Genes were determined to be changed in the first two studies by calculating the Students t test 
statistic between the two conditions and selecting those with t less than or equal to 0.05. The 
fold change was determined as the ratio of mean signal intensity in condition 2 to the mean 
signal intensity in condition 1 . In the first study three comparisons was undertaken: diabetics 
(condition 1) versus leans (condition 2), obese (condition 1) versus lean (condition 2) and 
diabetics (condition 1) versus obese (condition 2). The second study comparison is lean 
(condition 1) versus obese (condition 2). The third comparison is identification of fat specific 
or fat enriched genes when comparing the expression profile of human subcutaneous and 
omental adipose tissues to at last 12 other human adult tissues. Genes were determined to be 
meeting the criteria cut-off when the mean signal intensity of the human adipose samples was 
3 fold greater than the mean signal intensity of all the other human adult tissues profiled or 
called present only in the adipose samples and absent in all others by the Afiymetrix software 
program. 



15 



ADLICAN 

1277] Probe set 209596 detects ADLICAN nucleic acid sequences. 
Expression of ADLICAN transcripts was increased in diabetic compared to lean patients in 
the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[278] ADLICAN was also evaluated using real-time PCR. The results further 
show that ADLICAN is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 
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0.115 
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"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 
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|279J Probe set 209596 detects ADLICAN nucleic acid sequences. 
Expression of ADLICAN transcripts was increased in obese compared to lean patients in the 
gene profiling experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

(280] ADLICAN was also evaluated using real-time PCR. The results further 
show that ADLICAN is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 
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"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[281 1 Probe set 209596 detects ADLICAN nucleic acid sequences. 
Expression of ADLICAN transcripts was decreased in pio compared to vehicle treated 
cultures of primary human adipocytes in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Pre-Pio" and "Post-Pio" indicates sample was 
taken before or after 24 hours of pioglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; "Fold Change" indicates fold change of primary human adipocytes post-pio in 
comparison to pre-pio samples. 

[282 J Probe set 209596 detects ADLICAN nucleic acid sequences. 
Expression of ADLICAN transcripts was increased in adipose tissues compared to all other 
human adult tissues in the gene profiling experiment. 
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indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[283] ADLICAN contains the following protein domains (designated with 
reference to SEQ ID NO:2): Atrophin-1 family (PF03154) at amino acids 1405 to 
2232; Leucine rich repeat N-terminal domain (PF01462) at amino acids 26 to 
54; Geminivirus AL2 protein (PF01440) at amino acids 1317 to 1428; Leucine Rich Repeat 
(PF00560) at amino acids 80 to 103, 128 to 151; and Immunoglobulin domain (PF00047) at 
amino acids 494 to 557, 592 to 653, 1868 to 1930, 1965 to 2027, 2062 to 2124, 2161 to 2223, 
2258 to 2326, 2361 to 2420, 2459 to 2520, 2557 to 2618, 2652 to 2713, 2748 to 2812. 
ADLICAN is a protein which contains many domains which mediate protein-protein binding. 

ALDH1A3 

[284) Probe set 203 1 80 detects ALDH 1 A3 nucleic acid sequences. 
Expression of ALDH 1 A3 transcripts was increased in diabetic compared to lean patients in 
the gene profiling experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[2851 ALDH 1 A3 was also evaluated using real-time PCR. The results further 
show that ALDH1 A3 is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 
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expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 
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[286J Probe set 203 1 80 detects ALDH i A3 nucleic acid sequences. 
Expression of ALDH1 A3 transcripts was increased in obese compared to lean patients in the 
gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[287] ALDH 1 A3 was also evaluated using real-time PGR. The results further 
show that ALDH 1 A3 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 
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Expression Fold Change 
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0.018 
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'Told Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PGR. 

{2881 Probe set 203 1 80 detects ALDH 1 A3 nucleic acid sequences. 
Expression of ALDH 1 A3 transcripts was increased in obese compared to lean patients in the 
gene profiling experiment. 





OBESE 


LEAN 




B/C 


Mean 


SEM 


n 


Mean 


SEM 


n 


Fold 


Students t 


Gene 




Expr 






Expr 






Change 


test 


Name 


B 


42.76 


7.99 


5 


18.65 


1.47 


4 


2.29 


0.038 


ALDH 1 A3 



20 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[289] ALDH1 A3 was over-expressed in L6 myotubes and the effect on basal 
and insulin stimulated glucose transport was determined 



Glucose Transport Analysis in L6 Myotubes 
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0.057 



1.03 



0.634 



1.15±0.08 



Legend "Con" indicates control L6 myotubes that do not express hALDHl A3. 'TC" 
indicates the fold change defined as the following ratio; glucose transport in hALDHl A3- 
expressing cells/glucose transport in non-hALDHl A3-expressing cells, h" is human, "n" is 
the number of experiments. SEM is the standard error of the mean. 

5 (290) The results show that increased levels of ALDH1 A3 in a cell such as a 

muscle cell leads to a corresponding increase in glucose uptake. This indicates that 
increasing the levels or activity of ALDH1A3 in tissues of insulin resistant patients or 
diabetic patients will increase the ability of such tissues to take up glucose and hence, will 
provide an effective treatment for insulin resistance and diabetes. 

10 [291] ALDH1 A3 contains the following protein domains (designated with 

reference to SEQ ID NO:8): Aldehyde dehydrogenase family (PF00171) at amino acids 40 to 
507. ALDHIA3 is a retinaldehyde dehydrogenase that catalyzes the oxidation of all-trans- 
retinaldehyde to retinoic acid and may have a role in cell differentiation and proliferation 
(Grun, R, et alJBiol Chem. 275: 41210-8 (2000); Rexer, B. N„ et al Cancer Res. 61 : 7065- 

15 7070(2001). 

[292] It has been established that the mRNA for ALDH 1 A3 can be induced 
in hepatocytes by agents such as omeprazole (Nishimura et al Yakugaku ZasshL 122 :339-61 
(2002)). Thus, an exemplary method in which ALDH1 A3 activators can be identified 
comprises treating hepatocytes with candidate compounds and measuring increases in 
20 ALDH 1 A3 mRNA. 



ALK7 



[293] Probe set MBXHUMFAT04495 detects ALK7 nucleic acid sequences. 
Expression of ALK7 transcripts was decreased in diabetic compared to lean patients in the 
gene profiling experiment. 
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"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 



75 



BNSOOCIO <WO 200eOO7400A2J_» 



WO 2006/007400 



PCT/US2005/021297 



[294] ALK7 was also evaluated using real-time PCR. The results further 
show that ALK7 is significantly under-expressed in subcutaneous adiposfc from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


0.65 


0.017 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[295J Probe set MBXHUMFAT04495 detects ALK7 nucleic acid sequences. 
Expression of ALK7 transcripts was decreased in patients with insulin resistance compared to 
normal patients in the gene profiling experiment. 
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between glucose disposal rate (Rd) and signal intensities. A positive co-efficient indicates 
down regulation whereas a negative co-efficient indicates up regulation of the gene with 
increasing insulin resistance; "n" indicates number of patient samples. 

[296] ALK7 was also evaluated using real-time PCR, The results further 
show that ALK7 is significantly decreased in patients with insulin resistance compared to 
normal patients. 



Comparison 


Expression Fold Change 
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Correlation to Rd (18) 


0.534 


<0.02 



20 



co-efficient indicates down regulation whereas a negative co-efficient indicates up regulation 
of the gene with increasing insulin resistance. Numbers in parentheses indicates the number 
of patient samples analyzed by real-time PCR. 

[297] Probe set MBXHUMFAT04495 detects ALK7 nucleic acid sequences. 
Expression of ALK7 transcripts was decreased in obese compared to lean patients in the gene 
profiling experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 
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[298] Probe set MBXHUMFAT04495 detects ALK7 nucleic acid sequences. 
Expression of ALK7 transcripts was increased in adipose tissues compared to all other human 
adult tissues in the gene profiling experiment. 
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"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; t4 n" 



5 indicates number of tissue samples; *Told Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[299] ALK7 was over-expressed in L6 myotubes and the effect on basal and 
insulin stimulated glucose transport was determined 

10 Glucose Transport Analysis in L6 Mvotubes 
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the fold change defined as the following ratio; glucose transport in hALK7-expressing L6 
myotubes/glucose transport in non-ALK7-expressing L6 myotubes. h" is human. V is the 
number of experiments. SD is the standard deviation. 

15 

[300] The results show that increased levels of ALK7 in a cell such as a 
muscle cell leads to a corresponding decrease in glucose uptake. This indicates that 
decreasing the levels or activity of ALK7 in tissues of insulin resistant patients or diabetic 
patients will increase the ability of such tissues to take up glucose and hence, will provide an 

20 effective treatment for insulin resistance and diabetes. 

[301 ] ALK7 contains the following protein domains (designated with 
reference to SEQ ID NO: 14): Signal peptide at amino acids 1 to 25; Activin types I and II 
receptor domain (PF01064) at amino acids 15 to 100; Protein kinase domain (PF00069) at 
amino acids 195 to 482; u-PAR/Ly-6 domain (PF00021) at amino acids 30 to 94; and 1 

25 transmembrane domain (TMHMM2.0) at amino acids 1 14 to 136. ALK7 is a transmembrane 
receptor protein serine-threonine kinase for the transforming growth factor-beta (TGF-beta) 
superfamily related growth factors and signals through SMAD2 (Bondestam J. et al, 
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Cytogenet. Cell Genet., 95: 157-162 (2001). Nodal was identified as the ligand for ALK7. 
ALK7 may play a role in proliferation and apoptosis (Munir, S., et aL, J Biol Chem. May 18 
Epub (2004); Jornvall, H., et aL, J Biol Chem. 276: 5 140-6. (2001)). 

[302] ALK7 is a type I serine/threonine kinase receptor of the transforming 
growth factor (TGF)-beta family. Signalling from the ALK7 receptor involves 
phosphorylation of SMAD2 and SMAD3 (see, e.g., Kim J, et aL, J. Biol. Chem 279: 28458- 
28465 (2004)) Inhibitors of ALK7 kinase activity can thus be identified, for example, by 
using an in vitro phosphorylation assay containing recombinant ALK7 incubated with 
recombinant SMAD2 or SMAD3 and radio-labelled ATP 

(303) Inhibitors of ALK7 kinase activity such as SB 505124 (2-(5-benzo[l, 
3] dioxol-5-yl-2-tert-butyl-3H-imidazol-4-yl)-6-methylpyridine hydrochloride (e.g., Byfield 
et aL, Mol. Pharmacol 65 744-752 (2004) and SB 431542 (4-(5-benzol[l,3] dioxol-5-yl-4- 
pyridin-2-yl-lH-imidazol-2-yl)-benzamide (Inman et aL, Mol Pharmacol 62 65-74 (2002)) 
are known. Such inhibitors as well as other ALK7 kinase inhibitors, e.g. t identified using 
screening assays as described herein can be used to treat insulin resistance and diabetes. 



C3AR1 

[304J Probe set 209906 detects C3AR1 nucleic acid sequences. Expression 
of C3AR1 transcripts was increased in diabetic compared to lean patients in the gene 
profiling experiment. 
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20 B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[30SJ C3AR1 was also evaluated using real-time PCR. The results further 
show that C3AR1 is significantly over-expressed in subcutaneous adipose from diabetic 

25 individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


1.89 


0.008 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 
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[306] Probe set 209906 detects C3AR1 nucleic acid sequences. Expression 
of C3ARI transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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C3AR1 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; 4t n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[307J The cellular level of C3AR1 was reduced in 3T3-LI adipocytes using 
siRNA directed against C3AR1 and the effect on basal and insulin stimulated glucose 
transport was determined 
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murine C3AR1. "Scr" indicates the Dharmacon Scramble siRNA Control. "FC" indicates the 
fold change defined as the following ratio; Level of C3AR1 mRNA in C3AR1 siRNA 
transfected 3T3-L1 adipocytes/Level of C3AR1 mRNA in Scramble siRNA transfected 3T3- 
Ll adipocytes, "n" is the number of experiments. SEM is the standard error of the mean. 

Glucose Transport in 3T3-L1 Adipocytes Transfected with siRNA Oligonucleotides 
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20 



Legend: M siRNA" indicates Dharmacon Smartpool siRNA oligonucleotides directed 
against murine C3 AR1 . "Scr" indicates Dharmacon Scramble siRNA Control 
oligonucleotides. "FC" indicates the fold change defined as the following ratio; glucose 
transport in C3AR1 siRNA transfected 3T3-L1 adipocytes/glucose transport in Scramble 
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siRNA transfected 3T3-L1 adipocytes, "n" is the number of experiments. SEM is the 
standard error of the mean. 

[308] These results show that decreasing the levels of C3AR1 in a cell such 
5 as an adipocyte leads to a corresponding decrease in glucose uptake. This indicates that 
increasing the levels or activity of C3AR1 in tissues of insulin resistant patients or diabetic 
patients will increase the ability of such tissues to take up glucose and hence, will provide an 
effective treatment for insulin resistance and diabetes. 

[309] C3 AR1 contains the following protein domains (designated with 

1 0 reference to SEQ ID NO:20): 7 transmembrane receptor (rhodopsin family) (PF00001) at 

amino acids 40 to 435; and 7 transmembrane domains (TMHMM2.0) at amino acids 24 to 46, 
59 to 81, 96 to 1 18, 138 to 160, 338 to 360, 380 to 402, 417 to 439. C3AR1 is the G protein- 
coupled receptor for complement component 3 a and mediates various aspects of 
inflammatory responses including complement activation and chemotaxis (Fischer, W. H and 

15 Hugh T.E., 1 Immunol 159: 4279-4286 (1997); Zwimer, J., et aL, Eur J Immunol 28: 
1570-7. (1998); Crass, T., e/a/., Eur J Immunol 26: 1944-1950(1996)). 

[310] C3 AR1 is a G protein coupled receptor, activation of which results in 
the release of intracellular Ca2+ in HMC-1 cells (see, e.g., Legler, D.F. et al, EurJJmmunol 
26: 753-758 (1996)). Agonists of the C3AR1 can therefore be identified, for example, using 

20 assays that measure changes in intracellular calcium. An exemplary assay is a cell based 
assay in which cells over-expressing C3AR1, such as HMC-1 cells, are treated with 
compounds and an increase in intracellular Ca2+ is measured using Ca2+ sensitive dyes such 
as Calcium 3. 



25 



CALCRL 

[311] Probe set 21081 5 detects CALCRL nucleic acid sequences. Expression 
of CALCRL transcripts was decreased in diabetic compared to lean patients in the gene 
profiling experiment. 
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"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 
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[312] CALCRL was also evaluated using real-time PCR. The results further 
show that CALCRL is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 
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Expression Fold Change 
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Diabetic (10) /Lean (8) 


0.65 


0.001 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
5 expression/ mean lean expression. Numbers in parentheses indicates the number of patient 



samples analyzed by real-time PCR. 



[313] CALCRL was over-expressed in 3T3-L1 adipocytes and the effect on 
basal and insulin stimulated glucose transport was determined. 
Glucose Transport Analysis in 3T3-L1 Adipocytes: 
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indicates the fold change defined as the following ratio; glucose transport in hCALCRL- 
expressing 3T3-L1 adipocytes /glucose transport in non-hCALCRL expressing 3T3-L1 
adipocytes, h" is human. u n" is the number of experiments. SEM is the standard error of the 
mean. 

15 

[314] The results show that increasing the levels of CALCRL in a cell such 
as an adipocyte leads to a corresponding increase in glucose uptake. This indicates that 
increasing the levels or activity of CALCRL in tissues of insulin resistant patients or diabetic 

20 patients will increase the ability of such tissues to take up glucose and hence, will provide 
an effective treatment for insulin resistance and diabetes. 

[315] CALCRL contains the following protein domains (designated with 
reference to SEQ ID NO:26): Signal peptide at amino acids 1 to 22; Hormone receptor 
domain (PF02793) at amino acids 62 to 132; 7 transmembrane receptor (Secretin family) 

25 (PF00002) at amino acids 1 38 to 391 ; and 7 transmembrane domains (TMHMM2.0) at amino 
acids 144 to 166, 179 to 198, 225 to 247, 254 to 276, 291 to 313, 334 to 352, 367 to 389. 
CALCRL is the G protein-coupled receptor which binds calcitonin-gene-related peptide 
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(CGRP) or adrenomedullin (ADM) depending upon interaction with either of the accessory 
proteins, RAMP1 and RAMP2 and stimulates adenylyl cyclase (Kamitani, S., et aL, FEBS 
Lett. 448: 11 1-114 (1999); Kuwasako,K., et aL. Mol Pharmacol. 65: 207-13 (2004); 
Flahaut, M., et aL, Biochemistry. 42: 10333-41 (2003)). 

5 CCL13 



[31 6J Probe set 206407 detects CCL13 nucleic acid sequences. Expression of 
CCL13 transcripts was increased in diabetic compared to lean patients in the gene profiling 
experiment. 
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1 0 "SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 



(317] Probe set 206407 detects CCL1 3 nucleic acid sequences. Expression of 
CCL13 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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1 5 B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[318] CCL1 3 was also evaluated using real-time PCR. The results further 
show that CCL13 is significantly over-expressed in subcutaneous adipose from obese 

20 individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Cbanee 
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Obese (5)/ Lean (4) 


2.88 


0.048 



'Told Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 



(319] CCL1 3 contains the following protein domains (designated with 
25 reference to SEQ ID NO:32): Signal peptide at amino acids 1 to 23; and Small cytokines 

(inteerine/chemokine), interleukin-8 like (PF00048) at amino acids 24 to 89. A soluble active 
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secreted form of CCL13 has been detected (Berkhout, T.A., et al, J Biol Chem. 272:16404- 
13 (1997)) and this is displayed in SEQ ID NO:33. CCL13 displays chemotactic activity for 
monocytes, lymphocytes, basophils and eosinophils, but not neutrophils. This chemokine 
plays a role in accumulation of leukocytes during inflammation. It may also be involved in 
5 the recruitment of monocytes into the arterial wall during artherosclerosis (Garcia-Zepeda, E. 
A. et al, J Immunol 157: 5613-5626 (1996); White, J. R. et al, J Biol Chem 275: 36626- 
3663 1 (2000); Wain, J.H., et al, Clin Exp ImmunolATJ: 436-44 (2002)). 

CCL8 

[320] Probe set 214038 detects CCL8 nucleic acid sequences. Expression of 
1 0 CCL8 transcripts was decreased in pio compared to vehicle treated cultures of primary 
human adipocytes in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Pre-Pio" and "Post-Pio" indicates sample was 
taken before or after 24 hours of pioglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard error of mean; "n" indicates number of patient 



1 5 samples; "Fold Change" indicates fold change of primary human adipocytes post-pio in 
comparison to pre-pio samples. 

[321] CCL8 was also evaluated using real-time PCR. The results further 



show that CCL8 is significantly under-expressed in primary cultured human adipocytes 
treated with pio when compared to vehicle. 
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<0.000 



20 "Fold Change" indicates the fold expression calculated as the ratio of the mean pio 
expression/ mean vehicle expression. Numbers in parentheses indicates the number of 
primary human adipocyte samples analyzed by real-time PCR. 



[322J Probe set 214038 detects CCL8 nucleic acid sequences. Expression of 
CCL8 transcripts was decreased in rosi compared to vehicle treated cultures of primary 
25 human adipocytes in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Pre-Rosi" and "Post-Rosi" indicates sample 
was taken before or after 24 hours of rosiglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; "Fold Change" indicates fold change of primary human adipocytes post-rosi in 
comparison to pre-rosi samples. 

[323J CCL8 was also evaluated using real-time PCR. The results further 
show that CCL8 is significantly under-expressed in primary cultured human adipocytes 
treated with rosi when compared tQ vehicle. 



Comparison 


Expression Fold Change 


ttest 


Post-Rosi (12)/ Pre-Rosi (12) 


0.42 


<0.000 



10 



'Told Change" indicates the fold expression calculated as the ratio of the mean rosi 
expression/ mean vehicle expression. Numbers in parentheses indicates the number of 
primary human adipocyte samples analyzed by real-time PCR. 

[324] Probe set 214038 detects CCL8 nucleic acid sequences. Expression of 
CCL8 transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; *V 
indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[3251 CCL8 contains the following protein domains (designated with 
reference to SEQ ID NO:39): Signal peptide at amino acids 1 to 23; and Small cytokines 
(intecrine/chemokine), interleukin-8 like (PF00048) at amino acids 24 to 90. A soluble active 
secreted form of CCL8 has been detected (Van Damme, J. et al. t J Exp Med. 176: :59-65 
(1992)) and this is displayed in SEQ ID NO:40. CCL8 displays chemotactic activity for 
monocytes, lymphocytes, basophils and eosinophils. By recruiting leukocytes to sites of 
inflammation this cytokine mayxontribute to tumor-associated leukocyte infiltration and to 
the antiviral state against HIV infection ( Noso, N. et al, Biochem. Biophys. Res. Commun. 
200: 1470-1476 (1994); Yang, O. etal % J. Infect. Dis. 185: 1174-1178 (2002)). 
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CHI3L1 

[326] Probe set 209395 detects CHI3L1 nucleic acid sequences. Expression 
of CHI3L1 transcripts was increased in diabetic compared to lean patients in the gene 
profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; 4< Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[327] CHI3L1 was also evaluated using real-time PCR. The results further 
show that CHI3L1 is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) / Lean (8) 


3.38 


0.037 



15 



'Told Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[328] Probe set 20939S detects CHDL1 nucleic acid sequences. Expression 
of CHI3L1 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[329] Probe set 209395 detects CHI3L1 nucleic acid sequences. Expression 
of CHI3L1 transcripts was decreased in pio compared to vehicle treated cultures of primary 
human adipocytes in the gene profiling experiment. 
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taken before or after 24 hours of pioglitazone treatment; "Mean Expr" indicates mean 
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expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; 'Told Change" indicates fold change of primary human adipocytes post-pio in 
comparison to pre-pio samples. 

[330] CH13L1 was also evaluated using real-time PCR. The results further 



5 show that CHI3L1 is significantly under-expressed in primary cultured human adipocytes 
treated with pio when compared to vehicle. 



Comparison 


Expression Fold Change 


ttest 


Post-Pio(12)/Pre-Pio(12) 


0.22 


<0.000 



"Fold Change" indicates the fold expression calculated as the ratio of the mean pio 
expression/ mean vehicle expression. Numbers in parentheses indicates the number of 
primary human adipocyte samples analyzed by real-time PCR. 



10 (331) Probe set 209395 detects CHI3L1 nucleic acid sequences. Expression 

of CHI3L1 transcripts was decreased in rosi compared to vehicle treated cultures of primary 
human adipocytes in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Pre-Rosi" and "Post-Rosi" indicates sample 
was taken before or after 24 hours of rosiglitazone treatment; "Mean Expr" indicates mean 



1 5 expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; "Fold Change" indicates fold change of primary human adipocytes post-rosi in 
comparison to pre-rosi samples. 

[332] CHI3L1 was also evaluated using real-time PCR. The results further 
show that CHI3L1 is significantly under-expressed in primary cultured human adipocytes 

20 treated with rosi when compared to vehicle. 



Comparison 


Expression Fold Change 


ttest 


Post-Rosi (12)/ Pre-Rosi (12) 


0.25 


O.000 



Told Change" indicates the fold expression calculated as the ratio of the mean rosi 
expression/ mean vehicle expression. Numbers in parentheses indicates the number of 
primary human adipocyte samples analyzed by real-time PCR. 



[3331 CHDL1 was over-expressed in L6 myotubes and the effect on basal 
25 and insulin stimulated glucose transport was determined. 
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Glucose Transport Analysis in L6 Mvotubes 





Experiment 1 


Experiment 2 


Experiment 3 




| Insulin 
(nM) 
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Legend "Con" indicates contro 


L6 myotubes that do not express C] 


H13L1. TC" indicates 



the fold change defined as the following ratio; glucose transport in CHDL1 -expressing 
cells/glucose transport in non-CHDLl -expressing cells, h" is human, "n" is the number of 
5 experiments. SEM is the standard error of the mean. 

[334] The results show that increasing the levels of CHI3L1 in a cell such as 
a muscle cells leads to a corresponding increase in glucose uptake. This indicates that 
increasing the levels or activity of CHI3L1 in tissues of insulin resistant patients or diabetic 

10 patients will increase the ability of such tissues to take up glucose and hence, will provide an 
effective treatment for insulin resistance and diabetes. 

[3351 CHI3L1 contains the following protein domains (designated with 
reference to SEQ ED NO:46): Signal peptide at amino acids 1 to 21; and Glycosyl hydrolases 
family 1 8 (PF00704) at amino acids 22 to 357. A soluble active secreted form of CHI3L1 has 

15 been detected (Hakala, B.E., et al, J Biol Chem. 268:25803-10 (1993)) and this is displayed 
in SEQ ID NO:47. CHI3L1 is a glycoprotein secreted by a variety of cells including articular 
chondrocytes, synoviocytes and macrophages (Recklies, A.D., et al, Biochem J, 365: 1 19-26 
(2002)) and is associated with conditions of increased matrix turnover and tissue remodeling 
for example, arthritis (Punzi, L., et al, Ann Rheum Dis. 62: 1224-6 (2003); Ling, H. and 

20 Recklies, A.D., Biochem J. Mar 12;Pt. Epub (2004)). 

[336] The CHI3L1 gene has been cloned and the proximal promoter has been 
identified and shown to contain binding sites for transcription factors such as PU.l, Spl, Sp3, 
USF, AML-1 and C/EBP proteins (see, e.g., Rehli M, et al Genomics. 43: 221-225 (1997); 
Rehli, M, et al 1 Biol Chem 278: 44058-44067 (2003)). An exemplary method of screening 

25 for CHI3L1 regulators comprises an assay as follows: A CHI3L1 promoter is inserted 

upstream of a reporter gene such as p-galactosidase and expressed in cells. Compounds that 
up-regulate the activity of the promoter can thus be identified by measuing increased P- 
galactosidase activity. 
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CR1 

[337] Probe set 244313 detects CR1 nucleic acid sequences. Expression of 
CR1 transcripts was increased in diabetic compared to lean patients in the gene profiling 
experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[338] CR1 was also evaluated using real-time PCR. The results further show 
that CR1 is significantly over-expressed in subcutaneous adipose from diabetic individuals 
when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


1.96 


0.004 



15 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[339] Probe set 244313 detects CR1 nucleic acid sequences. Expression of 
CR1 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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20 
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"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[340] GR1 contains the following protein domains (designated with reference 
to SEQ ID NO:53): Sushi domain (SCR repeat) (PF00084) at amino acids 43 to 99, 104 to 
161, 166 to 232, 238 to 293, 297 to 353, 358 to 416, 421 to 487, 493 to 549, 554 to 61 1, 616 
to 682, 688 to 743, 747 to 803, 808 to 866, 871 to 937, 943 to 999, 1004 to 1061, 1066 to 
1132, 1138 to 1193, 1197 to 1253, 1258 to 1316, 1321 to 1387, 1393 to 1449, 1454 to 1511, 
1516 to 1582, 1588 to 1643, 1647 to 1703, 1708 to 1766, 1771 to 1837, 1846 to 1902, 1907 
to 1964, 1969 to 2035, 2041 to 2096, 2100 to 2156, 2161 to 2219, 2224 to 2290, 2298 to 
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2354, 2359 to 2415 and 1 transmembrane domain (TMHMM2.0) at amino acids 2447 to 2489. 
Complement receptor 1 (CR1) is a cell surface glycoprotein on erythrocytes, leukocytes, and 
other cells that inhibits both the classic and alternative pathways of complement activation, 
with both the classical and alternative pathways. CR1 also mediates other key immunological 
functions such as the transport of C3b-coated immune complexes in erythrocytes, activation 
of phagocytosis of C3b-bearing particles by neutrophils and monocytes, induction of 
interleukin 1 secretion by monocytes and enhancement of B-cell differentiation (Hamer, L, et 
al .Biochem. J. 329, 183-190 (1998); Makrides, S. C. et ai J. Biol. Chem. 267: 24754-24761 
(1992)). 



10 CSFR1 

(341] Probe set 203 104 detects CSFR1 nucleic acid sequences. Expression of 
CSFR1 transcripts was increased in diabetic compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

(3421 CSFR1 was also evaluated using real-time PCR. The results further 
show that CSFR1 is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


1.68 


0.009 



20 



25 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[343] Probe set 203 104 detects CSFR1 nucleic acid sequences. Expression of 
CSFR1 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 



[344] CSF1R was over-expressed in 3T3-L1 adipocytes and cells were then 
treated with CSF1 . The effect on basal and insulin stimulated glucose transport and Glut 4 
translocation was then determined 
Glucose Transport Analysis in 3T3-L1 Adipocytes: 
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Legend: "Con" indicates control 3T3-L1 adipocytes that do not express CSF1R. "FC" 
10 indicates the fold change defined as the following ratio; glucose transport in hCSFlR- 
expressing cells stimulated with 100 ng/ml hCSFl for 24 hours/glucose transport in non- 
hCSFlR-expressing cells stimulated with 100 ng/mlhCSFl for 24 hours, h" is human, "n" 
is the number of experiments. SEM is the standard error of the mean. 



15 Glut4 Translocation Analysis: 
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Legend "Fold Change" indicates the following ratio; (Mean % of hCSFlR-expressing cells 
incubated with 100 ng/mL murine CSF1 for 24 hours that were scored positive for cell 
surface Glut4)/(Mean % of LacZ-expressing cells cells incubated with 100 ng/mL murine 
CSF1 for 24 hours that were scored positive for cell surface Glut4). "h" is human, "n" is the 



20 number of experiments 

[345] The results show that increasing the levels of CSF1 R in a cell such as a 
adipocyte leads to a corresponding increase in glucose uptake. This indicates that increasing 
the levels or activity of CSF1R in tissues of insulin resistant patients or diabetic patients will 
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10 



15 



increase the ability of such tissues to take up glucose and hence, will provide an effective 
treatment for insulin resistance and diabetes. 

[346] CSFR1 contains the following protein domains (designated with 
reference to SEQ ID NO:57): Signal peptide at amino acids 1 to 19; Protein kinase domain 
(PF00069) at amino acids 582 to 910; two immunoglobulin domain (PF00047) at amino acids 
217 to 280, 412 to 487 and 1 transmembrane domain (TMHMM2.0) at amino acids 515 to 
537, CSFR1 is the tyrosine kinase receptor for colony stimulating factor 1, a cytokine which 
controls the production, differentiation, and function of macrophages and may be associated 
with advanced-stage breast carcinoma and myeloid leukemia (Sapi, E., Exp Biol Med. 229:1- 
1 1 (2004); Boultwood, J. et ai t Proc Natl Acad Sci USA 88: 6176-6180 (1991); Sapi, E. et 
ai t Cancer Res. 59: 5578-85 (1999); Fixe, P. and Praloran, V. Cytokine 10:32-7 (1998)). 

[347] Cells over-expressing CSF1R can be generated (see, e.g„ Murray L.J., 
et al Clin Exp Metastasis. 20: 757-66 (2003). Agonists of the CSF1R can be identified, e.g., 
by screening such cells for compounds that have the ability to induce autophosphorylation of 
theCSFlR. 



CTSK 

[348] Probe set 202450 detects CTSK nucleic acid sequences. Expression of 
CTSK transcripts was increased in diabetic compared to lean patients in the gene profiling 
experiment. 
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20 B/C indicates sample is from Basal or Clamp; "Mean Expr 19 indicates mean expression; 
"SEM** indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[349] CTSK was also evaluated using real-time PCR. The results further 
show that CTSK is significantly over-expressed in subcutaneous adipose from diabetic 

25 individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


1.68 


<0.000 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 
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[350] Probe set 202450 detects CTSK nucleic acid sequences. Expression of 
CTSK transcripts was increased in patients with insulin resistance compared to normal 
patients in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Corr Co-efficient" indicates the relationship 



5 between glucose disposal rate (Rd) and signal intensities. A positive co-efficient indicates 
down regulation whereas a negative co-efficient indicates up regulation of the gene with 
increasing insulin resistance; "n" indicates number of patient samples. 

[351 ] CTSK was also evaluated using real-time PCR, The results further 
show that CTSK is significantly increased in patients with insulin resistance compared to 
10 normal patients. 



Comparison 


Expression Fold Change 


ttest 


Correlation to Rd (26) 


-0.74 


<0.005 



"Corr Co-efficient" indicates the relationship between Rd and signal intensities. A positive 
co-efficient indicates down regulation whereas a negative co-efficient indicates up regulation 
of the gene with increasing insulin resistance. Numbers in parentheses indicates the number 
of patient samples analyzed by real-time PCR. 



1 5 [352] Probe set 202450 detects CTSK nucleic acid sequences. Expression of 

CTSK transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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'•Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; "n" 
indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 



20 comparison to all other human adult tissues profiled. 

[353] CTSK contains the following protein domains (designated with 
reference to SEQ ID NO:63): Signal peptide at amino acids 1 to 23; Outer membrane 
lipoprotein LolB (PF03550) at amino acids 4 to 158; and Papain family cysteine protease 
(PF00112) at amino acids 1 15 to 328. A soluble active secreted form of CTSK has been 

25 detected and this is displayed in SEQ ID NO:64. CTSK is a cysteine (thiol) protease involved 
in bone remodeling and reabsorption, acts as a collagenase towards cartilage proteoglycans 
and may play a role in extracellular matrix degradation. Mutations in this gene are the cause 
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of pycnodysostosis, an autosomal recessive disease characterized by osteosclerosis and short 
stature. (Motyckova, G. and Fisher, D.E., CurrMol Med. 2: 407-21(2002); Soderstrom, M. 
et ai t Biochim Biophys Acta 1446: 35-46 (1999); Hou, W.S., et a/., Biol Chem. 384: 891-7 
(2003)). 

5 CXCR4 

[354] Probe set 2 1 1 9 1 9 detects CXCR4 nucleic acid sequences. Expression 



of CXCR4 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 



1 0 "SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[355] CXCR4 was also evaluated using real-time PGR. The results further 



show that CXCR4 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals- 
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Expression Fold Change 


ttest 


Obese (5) / Lean (4) 


2.79 


0.001 



15 "Fold Change 11 indicates the fold expression calculated as the ratio of the mean obese 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR 

(356) CXCR4 was over-expressed in 3T3-L1 adipocytes and the cells were 
20 treated with SDF1 , a ligand for CXCR4. The effects on basal and insulin stimulated glucose 
transport and Glut 4 translocation were determined. 
Glucose Transport Analysis in 3T3-L1 Adipocytes: 
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Legend: "Con" indicates control 3T3-L1 adipocytes that do not express hCXCR4. "FC" 
indicates the fold change defined as the following ratio; glucose transport in hCXCR4- 
expressing cells incubated for 24 hours with 20 nM SDFl/glucose transport in non-CXCR4- 
expressing cells incubated for 24 hours with 20 nM SDF1 . h" is human, V is the number 
5 of experiments. SEM is the standard error of the mean. 



Glut4 Translocation Analysis: 



| Insulin (nM) 


Fold Change (Mean hCXCR4/Mean LacZ) (n=3) 


ttest \ 
(hCXCR4 vs LacZ) 


0 


2.33 


0.028 


0.5 


1.10 


0.344 


10 


0.94 


0.208 


Legend "Fold Change" indicates the following ratio; (Mean % of hCXCl 


EW-expressing cells 



10 incubated with 20 nM SDF1 for 24 hours that were scored positive for cell surface 



Glut4)/(Mean % of LacZ-expressing cells cells incubated with 20 nM SDF1 for 24 hours that 
were scored positive for cell surface Glut4). "h" is human, "n" is the number of experiments. 

[357J The results show that in a cell such as an adipocyte, increasing the 
1 5 levels of CXCR4 in the presence of the CXCR4 ligand SDF1 leads to a corresponding 

increase in glucose uptake. This indicates that increasing the levels or activity of CXCR4 in 
tissues of insulin resistant patients or diabetic patients will increase the ability of such tissues 
to take up glucose and hence, will provide an effective treatment for insulin resistance and 
diabetes. 

20 [358] CXCR4 contains the following protein domains (designated with 

reference to SEQ ID NO:70): 7 transmembrane receptor (Secretin family) (PF00002) at 
amino acids 45 to 280; and 7 transmembrane domains (TMHMM2.0) at amino acids 43 to 65, 
78 to 96, 1 1 1 to 132, 155 to 174, 199 to 221, 242 to 264, 284 to 306. CXCR4 is a G protein- 
coupled receptor that binds the CXC cytokine, CXCL12. CXCR4 may be required for 

25 hematopoiesis and organ vascularization (Tachibana, K. et al, Nature 393: 591-4(1998). It is 
known to act as a coreceptor for HIV (Moriuchi, Mef al t /. Immunol 159: 4322-4329 
(1997)) and inhibition of this receptor may be therapeutic for invasive breast cancer 
(Tamamura, H., et al., FEBS Lett. 550: 79-83 (2003)). 

(359] Stimulation of the CXCR4 receptor with its ligand SDF1 -alpha leads to 

30 an increase in intracellular Ca2+(wre. e.g., PrinccnK. etal JExpMed. 20;186: 1383-1388 
(1997)). Agonists of the CXCR4 receptor can therefore be identified, e.g. t by screening cells 
with high levels of the CXCR4 receptor to identify compounds that increases intracellular 
Ca2+ using a calcium sensitive dye such as Calcium 3 or Fluo 3. 
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DDAH2 

(360] Probe set 214909 detects DDAH2 nucleic acid sequences. Expression 
of DDAH2 transcripts was increased in diabetic compared to lean patients in the gene 
profiling experiment. 
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10 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[361 J Probe set 2 1 4909 detects DDAH2 nucleic acid sequences. Expression 
of DDAH2 transcripts was increased in patients with insulin resistance compared to normal 
patients in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Corr Co-efficient" indicates the relationship 
between glucose disposal rate (Rd) and signal intensities. A positive co-efficient indicates 
down regulation whereas a negative co-efficient indicates up regulation of the gene with 
increasing insulin resistance; "n" indicates number of patient samples. 

[362] DDAH2 was also evaluated using real-time PCR. The results further 
show that DDAH2 is significantly increased in patients with insulin resistance compared to 
normal patients. 



Comparison 


Expression Fold Change 


ttest 


Correlation to Rd (26) 


-0.556 


<0.005 



20 



'Coir Co-efficient" indicates the relationship between Rd and signal intensities. A positive 
co-efficient indicates down regulation whereas a negative co-efficient indicates up regulation 
of the gene with increasing insulin resistance. Numbers in parentheses indicates the number 
of patient samples analyzed by real-time PCR. 

[363) Probe set 214909 detects DDAH2 nucleic acid sequences. Expression 
of DDAH2 transcripts was increased in adipose tissues compared to all other human adult 
tissues in the gene profiling experiment. 
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indicates number of tissue samples; 4 ToId Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[364) DDAH2 contains the following protein domains (designated with 
reference to SEQ ID NO:76): Aminotransferase (PF02274) at amino acids 6 to 281 . 
DDAH2 regulates cellular methylarginine concentrations, which in turn inhibit nitric oxide 
synthase. DDAH2 expression predominates in more highly vascularized tissues and in 
immune tissues (Leiper, J.M., et al. t Biochem J. 343: 209-14 (1999). 

DERP7 

[365) Probe set 219410 detects DERP7 nucleic acid sequences. Expression 
of DERP7 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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DERP7 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[366) DERP7 was also evaluated using real-time PCR. The results further 
show that DERP7 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (5) / Lean (4) 


3.18 


0.074 



20 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[367J Probe set 219410 detects DERP7 nucleic acid sequences. Expression 
of DERP7 transcripts was decreased in pio compared to vehicle treated cultures of primary 
human adipocytes in the gene profiling experiment. 
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DERP7 
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B/C indicates sample is from Basal or Clamp; 'Tre-Pio" and "Post-Pio" indicates sample was 
taken before or after 24 hours of pioglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; "Fold Change" indicates fold change of primary human adipocytes post-pio in 
comparison to pre-pio samples. 

[368] DERP7 contains the following protein domains (designated with 
reference to SEQ ED NO:82): Family of unknown function (DUF716) (PF04819) at amino 
acids 1 13 to 251; and 7 transmembrane domains (TMHMM2.0) at amino acids 4 to 23, 44 to 
66, 91 to 1 13, 1 18 to 140, 150 to 172, 181 to 203, 218 to 240. DERP7 has high similarity to 
an uncharacterized mouse protein, p. 19.5. This is a putative membrane protein which was 
shown to be differentially expressed in two closely related T lymphoma cell clones 
(MacLeod CL. et al, Cell Growth Differ., 1(6): 271-279 (1990)). 



15 



ENDOGLYX1 

[369J Probe set 219091 detects ENDOGLYX1 nucleic acid sequences. 
Expression of ENDOGLYX1 transcripts was increased in obese compared to lean patients in 
the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

{370J ENDOGLYX1 was also evaluated using real-time PCR. The results 
further show that ENDOGLYX1 is significantly over-expressed in subcutaneous adipose 
from obese individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (5) /Lean (4) 


2.36 


0.018 



25 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 
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[3711 Probe set 219091 detects ENDOGLYX1 nucleic acid sequences. 
Expression of ENDOGLYX1 transcripts was increased in adipose tissues compared to all 
other human adult tissues in the gene profiling experiment. 
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ENDOGLYX1 



"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; 'V 



5 indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[372 J ENDOGLYX1 contains the following protein domains (designated 
with reference to SEQ ID NO:88): Seryl-tRNA synthetase N-terminal domain (PF02403) at 
amino acids 499 to 600; Myosin tail (PF01576) at amino acids 143 to 814; Apolipoprotein 

10 A1/A4/E family (PF01442) at amino acids 200 to 469; Clq domain (PF00386) at amino acids 
827 to 946; TNF(Tumour Necrosis Factor) family (PF00229) at amino acids 835 to 946; and 
Intermediate filament protein (PF00038) at amino acids 360 to 645. ENDOGLYX1 is a cell 
surface glycoprotein which is attached to the extracellular matrix and capable of forming 
homo- and heteromers via disulfide bonding. It may play a role in angiogenesis, 

1 5 vasculogenesis, cell-matrix adhesion, and hemostasis (Christian, S. et al. t J Biol Chem 276: 
48588-95 (2001); Leimeister, C, et al, DevBiol 249: 204-18 (2002)). 

ETL 

[373] Probe set MBXHUMFAT01 286 detects ETL nucleic acid sequences. 
Expression of ETL transcripts was increased in obese compared to lean patients in the gene 
20 profiling experiment. 
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ETL 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 



[374] Probe set MBXHUMFAT01 286 detects ETL nucleic acid sequences. 
25 Expression of ETL transcripts was increased in adipose tissues compared to all other human 
adult tissues in the gene profiling experiment. 
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ETL 



indicates number of tissue samples; 4 Told Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[3751 ETL was also evaluated using real-time PCR. The results further show 
that ETL is significantly over-expressed in adipose tissues when compared to all other human 
adult tissues. 



Comparison 


Expression Fold Change 


ttest 


Fat Tissues (5)/ All Other Tissues (13) 


5.15 


0.007 



tissues expression/ mean other tissues expression. Numbers in parentheses indicates the 
number of human adult tissue samples analyzed by real-time PCR. 

1 0 J3761 ETL contains the following protein domains (designated with reference 

to SEQ ID NO:94): BphX-like (PF06139) at amino acids 629 to 728; Latrophilin/CL-l-like 
GPS domain (PF01825) at amino acids 487 to 539; EGF-like domain (PF00008) at amino 
acids 183 to 220; 7 transmembrane receptor (Secretin family) (PF00002) at amino acids 545 
to 792; and 7 transmembrane domains (TMHMM2.0) at amino acids 552 to 574, 587 to 606, 

15 62 1 to 643, 655 to 677, 692 to 714, 740 to 762, 766 to 788. ETL belongs to the secretin 
family of G-protein-coupled peptide hormone receptors and the EGF-TM7 subfamily of 
receptors. The latter are characterized by a variable number of extracellular EGF and cell 
surface domains and conserved seven transmembrane-spanning regions. (Nechiporuk, T. et 
al t J Biol Chem 276: 4150-7 (2001)). 



20 FLJ12389 



J377) Probe set 218434 detects FU12389 nucleic acid sequences. Expression 
of FIJI 2389 transcripts was decreased in diabetic compared to lean patients in the gene 
profiling experiment. 
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FLJ12389 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; 'Told 
Change" indicates fold change of diabetics in comparison to lean patients. 
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[378] FU 12389 was also evaluated using real-time PCR. The results further 
show that FLJ12389 is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


0.19 


0.001 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[379J Probe set 218434 detects FLJ12389 nucleic acid sequences. Expression 
of FU12389 transcripts was decreased in obese compared to lean patients in the gene 
profiling experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; u Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[380J FLJ 12389 was also evaluated using real-time PCR. The results further 
show that FU12389 is significantly under-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (8)/ Lean (8) 


0.49 


0.014 



20 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[381] Probe set 218434 detects FLJ12389 nucleic acid sequences. Expression 
of FLJ12389 transcripts was increased in adipose tissues compared to all other human adult 
tissues in the gene profiling experiment. 
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indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 
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[382] FLJ 1 2389 contains the following protein domains (designated with 
reference to SEQ ID NO: 100): AMP-binding enzyme (PF00501) at amino acids 130 to 571. 
FIJI 2389 has some sequence similarity to acetyl coenzyme A synthetases and is predicted to 
contain ATP/GTP and AMP binding sites. FLJ 12389 may be a ketone body-utilizing enzyme 
of which the physiological role of remains unclear (Ohgami, M. el al t Biochem Pharmacol 
65: 989-994 (2003)). 

FZD4 

[383J Probe set 21 8665 detects FZD4 nucleic acid sequences. Expression of 
FZD4 transcripts was decreased in diabetic compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr** indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[384J FZD4 was also evaluated using real-time PCR. The results further 
show that FZD4 is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


0.82 


0.016 



20 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

1385] Probe set 218665 detects FZD4 nucleic acid sequences. Expression of 
FZD4 transcripts was decreased in patients with insulin resistance compared to normal 
patients in the gene profiling experiment. 
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between glucose disposal rate (Rd) and signal intensities. A positive co-efficient indicates 
down regulation whereas a negative co-efficient indicates up regulation of the gene with 
increasing insulin resistance; "n" indicates number of patient samples. 
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1386] FZD4 was also evaluated using real-time PCR. The results further 
show that FZD4 is significantly decreased in patients with insulin resistance compared to 
normal patients. 



Comparison 


Expression Fold Change 


ttest 


Congelation to Rd (18) 


0.651 


<0.005 



"Corr Co-efficienf indicates the relationship between Rd and signal intensities. A positive 
co-efficient indicates down regulation whereas a negative co-efficient indicates up regulation 
of the gene with increasing insulin resistance. Numbers in parentheses indicates the number 
of patient samples analyzed by real-time PCR. 



[387J Probe set 218665 detects FZD4 nucleic acid sequences. Expression of 



FZD4 transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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"Mean Expr" indicates mean expression; U SEM" indicates standard error of mean; **n" 
indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 



[388] F2D4 contains the following protein domains (designated with 
reference to SEQ ID NO:l 10): Signal peptide at amino acids 1 to 36; Frizzled/Smoothened 
family membrane region (PF01534) at amino acids 209 to 514; Fz domain (PF01392) at 
amino acids 35 to 159; and 7 transmembrane domains (TMHMM2.0) at amino acids 10 to 32, 
221 to 243, 253 to 275, 301 to 323, 394 to 416, 437 to 459, 474 to 496. FZD4 encodes a 7- 
transmembrane domain protein and is a receptor for Wnt signaling proteins. The auditory and 
cerebellar phenotypes of FZD4 null mice implicate Frizzled signaling in maintaining the 
viability and integrity of the nervous system in later life and retinal angiogenesis (Wang, Y., 
et al, J NeuroscL 21 : 4761-71 (2001); Singaraja, R.R., et aL Nat Genet 32: 326-30 (2002)). 

GLIPR1 

[389] Probe set 226142 detects GLIPR1 nucleic acid sequences. Expression 
of GUPR1 transcripts was increased in diabetic compared to lean patients in the gene 
profiling experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[390J GLIP Rl was also evaluated using real-time PCR. The results further 
show that GLIPR1 is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


t test 


Diabetic (10) /Lean (8) 
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"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expressipn. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[391] Probe set 226142 detects GLIPR1 nucleic acid sequences. Expression 
of GLIPRl transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[392] GLIPRl was also evaluated using real-time PCR. The results further 
show that GLIPRl is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (5)/ Lean (4) 


2.93 


0.015 



20 



"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[393] Probe set 226142 detects GLIPRl nucleic acid sequences. Expression 
of GLIPRl transcripts was decreased in pio+insulin compared to insulin treated cultures of 
primary human adipocytes in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Pre-Pio" and "Post-Pio" indicates sample was 
taken before or after 24 hours of pioglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; "Fold Change** indicates fold change of primary human adipocytes post-pio in 



5 comparison to pre-pio samples. 

[394] Probe set 226142 detects GLIPR1 nucleic acid sequences. Expression 



of GLIPR1 transcripts was decreased in rosi+insulin compared to insulin treated cultures of 
primary human adipocytes in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Pre-Rosi" and "Post-Rosi" indicates sample 



1 0 was taken before or after 24 hours of rosiglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard eiror of mean; "n" indicates number of patient 
samples; "Fold Change** indicates fold change of primary human adipocytes post-rosi in 
comparison to pre-rosi samples. 

[395J GLLPR1 contains the following protein domains (designated with 

1 5 reference to SEQ ED NO: 11 8): Signal peptide at amino acids 1 to 2 1 ; SCP-like extracellular 
protein (PF00188) at amino acids 38 to 174; and 1 transmembrane domain (TMHMM2.0) at 
amino acids 235 to 257. GLBPR1 is a putative secreted protein that may play a role in 
inhibition of malignant growth and progression through its proapoptotic activities (Ren, C. et 
aL, Mol Cell Biol 22: 3345-57 (2002)). 

20 GPR105 

1396] Probe set 206637 detects GPR105 nucleic acid sequences. Expression 
of GPR105 transcripts was increased in diabetic compared to lean patients in the gene 
profiling experiment. 
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GPR105 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[397] GPR1 05 was also evaluated using real-time PCR. The results further 
show that GPR105 is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


t test 


Diabetic (10) /Lean (8) 


2 


0.007 



10 



15 



20 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[3981 GPR105 contains the following protein domains (designated with 
reference to SEQ ID NO:124): 7 transmembrane receptor (rhodopsin family) (PF00001) at 
amino acids 39 to 295; and 7 transmembrane domains (TMHMM2.0) at amino acids 27 to 49, 
56 to 78, 98 to 1 17, 137 to 159, 187 to 209, 235 to 257, 279 to 298. GPR105 is a G(i/o)- G 
protein-coupled receptor that is activated by extracellular UDP-sugars. Activation of the 
receptor stimulates intracellular calcium and may mediate primitive hematopoietic cell 
responses to microenvironments (Chambers, J. K. et al, J Biol Chem 275: 10767-71 (2000); 
Lee,B. C. etal, GenesDev\l\ 1592-604(2003); Skelton, L.e/ al, J Immunol 171: 1941-9 
(2003); Moore, D. J. et al. Brain Res Mol Brain Res 1 18: 10-23 (2003)). 

GPR146 

[399] Probe set 228770 detects GPR146 nucleic acid sequences. Expression 
of GPR146 transcripts was decreased in diabetic compared to lean patients in the gene 
profiling experiment. 
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"SEM" indicates standard error of mean; V indicates number of patient samples; 'Told 
Change" indicates fold change of diabetics in comparison to lean patients. 
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[400J GPR146 was also evaluated using real-time PCR. The results further 
show that GPR146 is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


0.66 


0.014 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[401J Probe set 228770 detects GPR146 nucleic acid sequences. Expression 
of GPR146 transcripts was decreased in patients with insulin resistance compared to normal 
patients in the gene profiling experiment 





CORRELATIOl 


NRD 




B/C 


n 


Corr Co-efficient 


Students t test 


Gene Name 


C 


26 


0.679 


<0.005 


GPR146 



10 



15 



between glucose disposal rate (Rd) and signal intensities. A positive co-efficient indicates 
down regulation whereas a negative co-efficient indicates up regulation of the gene with 
increasing insulin resistance; "n" indicates number of patient samples. 

(402J GPR146 was also evaluated using real-time PCR. The results further 
show that GPR146 is significantly decreased in patients with insulin resistance compared to 
normal patients. 



Comparison 


Expression Fold Change 


ttest 


Correlation to Rd (26) 


0.535 


<0.005 



20 



co-efficient indicates down regulation whereas a negative co-efficient indicates up regulation 
of the gene with increasing insulin resistance. Numbers in parentheses indicates the number 
of patient samples analyzed by real-time PCR. 

|403] The cellular level of GPR146 was reduced in 3T3-L1 adipocytes using siRNA 
directed against GPR146 and the effect on basal and insulin stimulated glucose transport was 
determined 



106 

BNSDOCIQ: «WO_J»0e00740CA2J_> 



WO 2006/007400 



PCT/US2005/021297 



GPR146 mRNA Level in 3T3-L1 Adipocytes Transfected with siRNA O ligonucleotides 





Exptl 


Expt 2 


Expt 3 


Expt 4 


Mean FC 
±SEM(n=4) 


FC 

(siRN A/Scr) 


0.545 


0.443 


0.399 


0.524 


0.478 ± 0.03 



Legend: "siRNA" indicates Dharmacon Smartpool siRNA oligonucleotides directed against 
murine GPR146. "Scr" indicates the Dharmacon Scramble siRNA Control. "FC" indicates the 
fold change defined as the following ratio; Level of GPR146 mRNA in GPR146 siRNA 
transfected 3T3-L1 adipocytes/Level of GPR146 mRNA in Scramble siRNA transfected 3T3- 
Ll adipocytes. V 1 is the number of experiments. SEM is the standard error of the mean. 



Glucose Transport in 3T3-L1 Adipocytes Transfected with siRNA Oligonucleotides 
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against murine GPR146. "Scr" indicates Dharmacon Scramble siRNA Control 
oligonucleotides. "FC" indicates the fold change defined as the following ratio; glucose 
transport in GPR146 siRNA transfected cells/glucose transport in Scramble siRNA 
transfected cells. V is the number of experiments. SEM is the standard error of the mean. 

(404] The results show that decreasing the levels of GPR1 46 in a cell such as 
an adipocyte leads to a corresponding decrease in glucose uptake. This indicates that 
increasing the levels of GPR146 in tissues of insulin resistant patients or diabetic patients will 
increase the ability of such tissues to take up glucose and hence, will provide an effective 
treatment for insulin resistance and diabetes. 

[405] GPR1 46 contains the following protein domains (designated with 
reference to SEQ ID NO: 130): Signal peptide at amino acids 1 to 37; and a 7 transmembrane 
receptor (rhodopsin family) (PF00001) at amino acids 44 to 296. GPR146 is a member of the 
rhodopsin family of G protein-coupled receptors (GPCR) with very low sequence similiarity 
to GPR30. 
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[406] GPR146 is an orphan GPCR with unknown coupling to G proteins 
(Goriam DE et al. Biochim Biophys Acta.l722:235-46 (2005)). Agonists of GPR146 can be 
detected, e.g., using cells over-expressing GPR146 along with either a promiscuous G protein 
(such as Gctl6, Kostenis E. Trends Pharmacol ScL 22:560-4 (2001) or a chimeric G protein 
(such as Gal6z or Gal6s) (see, e.g., Liu AM et al JBiomol Screen. 8:39-49 (2003), Hazari 
ketal CellSignal. 16:51-62(2004)). Screening of such cells will detect GPR 146 agonists, 
for example either by measuring intracellular Ca2+ with a Ca2+ sensitive dye such as 
Calcium 3 or by measuring intracellular cyclic AMP. 

GPR30 

[407] Probe set 2 1 0640 detects GPR30 nucleic acid sequences. Expression of 
GPR30 transcripts was decreased in diabetic compared to lean patients in the gene profiling 
experiment. 
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"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[408] GPR30 was also evaluated using real-time PGR. The results further 
show that GPR30 is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


t test 


Diabetic (10) /Lean (8) 


0.79 1 


0.005 



20 



Told Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[409] Probe set 210640 detects GPR30 nucleic acid sequences. Expression of 
GPR30 transcripts was decreased in patients with insulin resistance compared to normal 
patients in the gene profiling experiment 





CORRELATIOl 


N RD 




B/C 


n 


Corr Co-efficient 


Students t test 


Gene Name 


C 


26 


0.608 


<0.005 


GPR30 



between glucose disposal rate (Rd) and signal intensities. A positive co-efficient indicates 
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down regulation whereas a negative co-efficient indicates up regulation of the gene with 
increasing insulin resistance; "n" indicates number of patient samples. 

[41 0] GPR30 was also evaluated using real-time PCR. The results further 
show that GPR30 is significantly decreased in patients with insulin resistance compared to 



5 normal patients. 



Comparison 


Expression Fold Change 


ttest 


Correlation to Rd (26) 


0.5 


<0.01 



"Corr Co-efficient" indicates the relationship between Rd and signal intensities. A positive 
co-efficient indicates down regulation whereas a negative co-efficient indicates up regulation 
of the gene with increasing insulin resistance. Numbers in parentheses indicates the number 
of patient samples analyzed by reai-rime PCR. 



10 [41 1] Probe set 21 0640 detects GPR30 nucleic acid sequences. Expression of 

GPR30 transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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GPR30 



"Mean Expr" indicates mean expression; "SENT' indicates standard error of mean; "n" 
indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 



1 5 comparison to all other human adult tissues profiled. 

[412] GPR30 contains the following protein domains (designated with 
reference to SEQ ID NO: 136): 7 transmembrane receptor (rhodopsin family) (PF00001) at 
amino acids 76 to 324; and 7 transmembrane domains (TMHMM2.0) at amino acids 63 to 85, 
97 to 1 19, 134 to 1 53, 174 to 196, 219 to 241, 262 to 284, 304 to 326. GPR30 a G protein- 

20 coupled receptor that stimulates adenylyl cyclase and mediates attenuation of Erk-l/-2 
activity by estrogen via Raf-1 inactivation. GPR30 is a progestin target gene whose 
expression correlates with progestin-induced growth inhibition in breast cancer cells (Filardo, 
E. J. et ai t Mol Endocrinol 14: 1649-60 (2000); Ahola, T. M. et al t Endocrinology 143: 
4620-6 (2002)). 

25 GPR65 

[4131 Probe set 214467 detects GPR65 nucleic acid sequences. Expression of 
GPR65 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 

109 



BNSDOCID: <WO 200600740GA2J_> 



WO 2006/007400 



PCT/US2005/021297 





OBESE 


LEAN 




B/C 


Mean 
Expr 


SEM 


n 


Mean 
Expr 


SEM 


n 


Fold 
Chance 


Students t 
test 


Gene 
Name 


B 


11.44 


0.48 


5 


7.23 


0.95 


4 


1.58 


0.013 


GPR65 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; 4< Fold 
Change" indicates fold change of obese in comparison to lean patients. 



[414] GPR65 was also evaluated using real-time PCR. The results further 



5 show that GPR65 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 



! Comparison 


Expression Fold Change 


ttest 


Obese (5) /Lean (4) 


4.79 


0.012 



"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

1 0 [41 5J GPR65 contains the following protein domains (designated with 

reference to SEQ ID NO:142): 7 transmembrane receptor (rhodopsin family) (PF00001) at 
amino acids 31 to 290; and 7 transmembrane domains (TMHMM2.0) at amino acids 15 to 37, 
49 to 71, 91 to 1 10, 130 to 152, 181 to 203, 224 to 246, 271 to 293. GPR65 is a G protein- 
coupled receptor activated by the glycosphingolipid psychosine. It may function in apoptosis 

15 and immunological autotolerance, and plays a role in T-cell associated diseases and Globoid 
cell leukodystrophy (Kyaw, H. et al„ DNA Cell Biol 17: 493-500 (1998); Im, D. S. et al t J 
Cell Biol 153: 429-34 (2001)). GPR65 has been described to be expressed in human primary 
monocytes and macrophages (Duong, C.Q., et al, Biochim BiophysActa. 1682: 1 12-9(2004)). 

HTR2B 

20 (416] Probe set 206638 detects HTR2B nucleic acid sequences. Expression 

of HTR2B transcripts was decreased in pio compared to vehicle treated cultures of primary 



human adipocytes in the gene profiling experiment. 
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B/C indicates sample is from Basal or Clamp; "Pre-Pio' l and "Post-Pio" indicates sample was 



taken before or after 24 hours of pioglitazone treatment; "Mean Expr" indicates mean 
25 expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
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samples; "Fold Change" indicates fold change of primary human adipocytes post-pio in 
comparison to pre-pio samples. 

(4171 HTR2B was also evaluated using real-time PCR- The results further 
show that HTR2B is significantly under-expressed in primary cultured human adipocytes 



5 treated with pio when compared to vehicle. 



Comparison 


Expression Fold Change 


ttest 


Post-Pio (12) /Pre-Pio (12) 


0.63 


0.002 



'Told Change" indicates the fold expression calculated as the ratio of the mean pio 
expression/ mean vehicle expression. Numbers in parentheses indicates the number of 
primary human adipocyte samples analyzed by real-time PCR. 
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[418] HTR2B was over-expressed in 3T3-L1 adipocytes and the cells were 
then treated with 5-HT and the effect on basal and insulin stimulated glucose transport was 
determined. 

Glucose Transport Analysis in 3T3-L1 Adipocytes: 
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Legend: "Con" indicates control 3T3-L1 adipocytes that do not express hHTR2B. "FC" 
indicates the fold change defined as the following ratio; glucose transport in HTR2B- 
expressing cells stimulated with 1 uM 5HT for 3 hours/glucose transport in non-HTR2B- 
expressing cells stimulated with luM 5HT for 3 hours, h" is human, "n" is the number of 
experiments. SEM is the standard error of the mean. 



20 (41 9] These results show that in a cell such as a adipocyte, increasing the 

levels of HTR2B in the presence of its ligand 5-HT leads to a corresponding increase in 
glucose uptake. This indicates that increasing the levels or activity of HTR2B in tissues of 
insulin resistant patients or diabetic patients will increase the ability of such tissues to take up 
glucose and hence, will provide an effective treatment for insulin resistance and diabetes. 

25 [420J HTR2B contains the following protein domains (designated with 

reference to SEQ ID NO: 148): 7 transmembrane receptor (rhodopsin family) (PF00001) at 
amino acids 71 to 380; and 7 transmembrane domains (TMHMM2.0) at amino acids 59 to 81, 
93 to 1 15, 130 to 149, 170 to 192, 217 to 239, 327 to 349, 364 to 383. HTR2B is a 5- 
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hydroxytryptamine 2B (serotonin) receptor that signals through phospholipase C and is 
known to induce mitogenesis, mediate contractile effects of serotonin on GI tract smooth 
muscle and may play a role in digestion and migraine headaches (Duxon M.S. et al, 
Neuroscience 76(2): 323-9 (1997); Jerman J.C. et al., Eur J Pharmacol., 414(1): 23-30 
(2001); Launay, J.M., et al, J Biol Chem. 271: 3141-7 (1996); Nebigtl, C.G. et al., Proc Natl 
Acad Sci USA. 97: 2591-6 (2000); Schaerlinger, B., et al, Br J Pharmacol. 140: 277-84. 
Epub (2003)). 

[421J The HTR2B receptor is a G protein coupled receptor linked to the 
mobilization of intracellular Ca2+ (see, e.g., Schmuck IC et al FEBSLett.lAl :85-90 (1994)). 
Agonists of the HTR2B receptor can therefore be identified, e.g., by screening cells with high 
levels of the HTR2B receptor to identify compounds that increase intracellular Ca2+ using a 
calcium sensitive dye such as Calcium 3 or Fluo 3. 



15 



ITGB2 

[422] Probe set 202803 detects ITGB2 nucleic acid sequences. Expression of 
1TGB2 transcripts was increased in diabetic compared to lean patients in the gene profiling 
experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[423] ITGB2 was also evaluated using real-time PCR. The results further 
show that ITGB2 is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 

Ht-IJ /-II II • J' .» it-- /> IJ 


2.41 


0.02 



25 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

|424] Probe set 202803 detects ITGB2 nucleic acid sequences. Expression of 
ITGB2 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[4251 ITGB2 was also evaluated using real-time PCR. The results further 
show that ITGB2 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (5)/ Lean (4) 


5.65 


0.05 



10 



"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[426] ICAM- 1 , the ligand for ITGB2-containing integrin complexes such as 
LFA-1, was added to cultures of 3T3-L1 adipocytes and the effect on glucose transport and 
Glut 4 translocation was determined. 



Insulin (nM) 
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Legend "Fold Change" indicates the following ratio; (Mean % of 3T3-L 


[ adipocytes 
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incubated 3 hours with 10 ug/ml ICAM + 200 uM Mn2+ scored positive for cell surface 
Glut4)/(Mean % 3T3-L1 adipocytes incubated 3 hours with 200 uM Mn2+ scored positive for 
cell surface Glut4). "h" is human. 4 *n" is the number of experiments. 

[427) Incubating 3T3-L1 adipocytes with 200 uM Mn2+ for 3 hours 
enhanced cell surface Glut4 in the absence of insulin. In contrast, including 10 ug/ml ICAM 
inhibited the increase observed with Mn alone. 
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Glucose Transport Analysis in 3T3-L1 Adipocytes: 
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Legend: "FC" indicates the fold change defined as the following ratio; glucose transport in 
3T3-L1 adipocytes incubated with 10 ug/ml ICAM1 + 200 uM Mn for 3 hours/glucose 
transport in 3T3-L1 adipocytes incubated with 200 uM Mn for 3 hours, h" is human, "n" is 



5 the number of experiments. SD is the standard deviation. 

[428J The results show that increasing the activity of ITGB2 containing 
integrins such as LFA-1 in a cell such as an adipocyte leads to a corresponding decrease in 
glucose uptake. This indicates that decreasing the levels or activty of ITGB2-containing 

10 integrins such as LFA-1 in tissues of insulin resistant patients or diabetic patients will 

increase the ability of such tissues to take up glucose and hence, will provide an effective 
treatment for insulin resistance and diabetes. 

[429] ITGB2 contains the following protein domains (designated with 
reference to SEQ ID NO: 154): Signal peptide at amino acids 1 to 22; Plexin repeat 

1 5 (PF01437) at amino acids 24 to 63; Integrin, beta chain (PF00362) at amino acids 32 to 
447; EGF-like domain (PF00008) at amino acids 582 to 612; and 1 transmembrane domain 
(TMHMM2.0) at amino acids 701 to 723. The ITGB2 protein product is the integrin beta 
chain beta 2. Integrins are integral cell-surface proteins composed of an alpha chain and a 
beta chain. A given chain may combine with multiple partners resulting in different integrins. 

20 For example, beta 2 combines with the alpha L chain to form the integrin LFA-1 , and 
combines with the alpha M chain to form the integrin Mac-1 . Integrins are known to 
participate in cell adhesion as well as cell-surface mediated signalling. The regulation of 
interaction mediated by adhesion molecules may provide new targets for controlling 
inflammatory and immune responses (Bunting, M. et aL, Curr Opin HematoL 9: 30-5 (2002); 

25 Tsuji T. et aL, Blood 91(4): 1263-71 (1998); Zhang L. and Plow E.F. Biochemistry 38(25): 
8064-71 (1999)). 

' (430] Inhibitors of LFA-1 can be detected using a variety of assays, such as 
those that detect the ability of candidate compounds to disrupt the association of LFA-1 with 
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the ligand ICAM-1 . For example, purified LFA-1 can be immobilized and compounds that 
cause the inhibition of biotinylated ICAM-1 binding can be detected. 

[431] Inhibitors of the LFA-1 integrin complex (alphaL/beta2) have been 
developed. These include inhibitors such as BIRT0377, LFA703 and A-286982 (Shimakoa 
5 et al Nat, Rev. Drug Discovery 2: 703-716 (2003). Such inhibitors, as well as other inhibitors 
of LFA-1 integrin complex, are useful as agents for the treatment for insulin resistance and 
diabetes. 



ITIH5 



10 



[432J Probe set MBXHUMFAT04252 detects ITIH5 nucleic acid sequences. 
Expression of ITIH5 transcripts was increased in diabetic compared to lean patients in the 
gene profiling experiment. 
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15 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

{4331 ITIH5 was also evaluated using real-time PCR, The results further 
show that ITIH5 is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


1.78 


0.001 



20 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[434] Probe set MBXHUMFAT04252 detects ITEH5 nucleic acid sequences. 
Expression of ITIH5 transcripts was increased in obese compared to lean patients in the gene 
profiling experiment. 
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25 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; V* indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 
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[435] ITIH5 was also evaluated using real-time PCR. The results further 
show that ITIH5 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (8)/ Lean (8) 


1.43 


0.094 



"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

1436] Probe set MBXHUMFAT04252 detects ITfflS nucleic acid sequences. 
Expression of ITIH5 transcripts was increased in adipose tissues compared to all other human 
adult tissues in the gene profiling experiment. 
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ITIH5 



•'Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; "n" 
indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled 

[437] ITIH5 was over-expressed in 3T3-L1 adipocytes and the effect on 
basal and insulin stimulated glucose transport and Glut 4 translocation was determinred 
Glucose Transport Analysis in 3T3-L1 Adipocytes: 
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20 



hat do not express ITIH5. "FC 1 



Legend: "Con" indicates control 3T3-L1 adipocytes th 
indicates the fold change defined as the following ratio; glucose transport in ITM5- 
expressing 3T3-L1 adipocytes /glucose transport in non-PTPRE-expressing 3T3-L1 
adipocytes, h" is human, "n" is the number of experiments. SEM is the standard error of the 
mean. 



116 

BNSOOCID: <WO__200600740QA2_t> 



WO 2006/(107400 



PCT/US2005/021297 



Glut4 Translocation Analysis 



r — 1 ^ 

Insulin (nM) 


Fold Change (Mean hITIH5/Mean LacZ) (n=3) 


ttest " 1 
(hlTTHS vs LacZ) | 
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2.43 


0.004 J 
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0.422 "~l 




0.96 


0.170 | 



scored positive for cell surface Glut4)/(Mean % of LacZ-expressing cells scored positive for 
cell surface Glut4). "h" is human, "n" is the number of experiments. 

[438] The results show that increasing the levels of ITIH5 in a cell such as an 
adipocyte leads to a corresponding increase in glucose uptake. This indicates that increasing 
the levels or activty of ITIH5 in tissues of insulin resistant patients or diabetic patients will 
increase the ability of such tissues to take up glucose and hence, will provide an effective 

1 0 treatment for insulin resistance and diabetes. 

|439] ITIH5 contains the following protein domains (designated with 
reference to SEQ ED NO:160): Signal peptide at amino acids 1 to 21; Inter-alpha-trypsin 
inhibitor heavy chain C-terminus (PF06668) at amino acids 71 5 to 909; T-box (PF00907) at 
amino acids 31 0 to 426; and von Willebrand factor type A domain (PF00092) at amino acids 

15 295 to 478. A soluble active secreted form of ITIH5 has been detected and this is displayed in 
SEQ ID NO: 16 1. ITIH5 belongs to the inter-alpha-trypsin inhibitor (ITI) family constitutes a 
group of proteins built up from one light chain and a variable set of heavy chains. Originally 
identified as plasma protease inhibitors, recent data indicate that ITI proteins play a role in 
extracellular matrix (ECM) stabilization and in prevention of tumor metastasis. ITIH5 

20 expression was found to be consistently downregulated in invasive mammary ductal 
carcinomas (Himmelfarb M. etal, Cancer Lett 204(1): 69-77 (2004)). 
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LGALS12 

[4401 Probe set 223828 detects LG ALS 1 2 nucleic acid sequences. 
Expression of LG ALS 12 transcripts was decreased in diabetic compared to lean patients in 
the gene profiling experiment. 
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LGALS12 



"SEM" indicates standard error of mean; "n" indicates number of patient samples; Told 
Change*' indicates fold change of diabetics in comparison to lean patients. 
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[441 J LGALS 1 2 was also evaluated using real-time PCR. The results further 
show that LGALS 12 is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) / Lean (8) 


0.31 


<0.000 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[442] Probe set 223828 detects LGALS 12 nucleic acid sequences. 
Expression of LGALS 12 transcripts was decreased in obese compared to lean patients in the 
gene profiling experiment. 
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"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[443] LGALS 12 was also evaluated using real-time PCR. The results further 
show that LGALS 12 is significantly under-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (8)/ Lean (8) 


0.49 


0.002 



20 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[444] Probe set 223828 detects LGALS 12 nucleic acid sequences. 
Expression of LGALS 12 transcripts was decreased in obese compared to lean patients in the 
gene profiling experiment. 
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"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 
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I445J Probe set 223828 detects LGALS 1 2 nucleic acid sequences. 
Expression of LGALS 12 transcripts was increased in adipose tissues compared to all other 
human adult tissues in the gene profiling experiment. 
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indicates number of tissue samples; "Fold Change** indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[446J LGALS 1 2 contains the following protein domains (designated with 
reference to SEQ ID NO: 165): Galactoside-binding lectin (PF00337) at amino acids 48 to 
182. LGALS12 is a a member of the beta-galactoside-binding lectin family and may be an 
apoptosis activator that negatively regulates the cell cycle and cell proliferation (Yang R.Y. et 
ai t J Biol Chern., 276(23): 20252-60 (2001); Hotta K. et aL, J Biol Chem., 276(36): 34089-97 
(2001); Liu, F.T., et aL, Biochim Biophys Acta. 1 572: 263-73 (2002)). 

NMB 

(447] Probe set 205204 detects NMB nucleic acid sequences. Expression of 
NMB transcripts was increased in diabetic compared to lean patients in the gene profiling 
experiment. 
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Expr" indicates mean expression; 



20 



"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[448| Probe set 205204 detects NMB nucleic acid sequences. Expression of 
NMB transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; "n" 
indicates number of tissue samples; 'Told Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[449] NMB was also evaluated using real-time PCR. The results further 
show that NMB is significantly over-expressed in adipose tissues when compared to all other 
human adult tissues. 



Comparison 


Expression Fold Change 


ttest 


Fat Tissues (5)/ All Other Tissues (13) 


26.71 


0.029 



tissues expression/ mean other tissues expression. Numbers in parentheses indicates the 
number of human adult tissue samples analyzed by real-time PCR. 

1 0 [4S0J NMB contains the following protein domains (designated with 

reference to SEQ ID NO: 181): Signal peptide at amino acids 1 to 26; and Bombesin-like 
peptide (PF02044) at amino acids 47 to 60. A soluble active secreted form of NMB has been 
detected and this is displayed in SEQ ID NO: 182. NMB is a bombesin-related neuropeptide 
which induces calcium flux and phosphatidylinositol turnover leading to stimulatation of cell 

1 5 proliferation through the G-protein coupled neuromedin B receptor (Ohki-Hamazaki H. Prog. 
NeurobioL 62(3): 297-312 (2000); Mason S. el aL, Eur J Pharmacol, 438(1-2): 25-34 
(2002)). 
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NNAT 

[4S1 1 Probe set 204239 detects NNAT nucleic acid sequences. Expression of 
NNAT transcripts was decreased in diabetic compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 
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[452] NNAT was also evaluated using real-time PCR. The results further 
show that NNAT is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) /Lean (8) 


0.64 


0.14 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 



5 expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR, 

|453] Probe set 204239 detects NNAT nucleic acid sequences. Expression of 



NNAT transcripts was increased in rosi compared to vehicle treated cultures of primary 
human adipocytes in the gene profiling experiment 
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1 0 B/C indicates sample is from Basal or Clamp; "Pre-Rosi" and "Post-Rosi" indicates sample 
was taken before or after 24 hours of rosiglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; "Fold Change" indicates fold change of primary human adipocytes post-rosi in 
comparison to pre-rosi samples. 



1 5 [454] NNAT was also evaluated using real-time PCR. The results further 

show that NNAT is significantly over-expressed in primary cultured human adipocytes 
treated with rosi when compared to vehicle. 



Comparison 


Expression Fold Change 


ttest 


Post-Rosi (12)/ Pre-Rosi (12) 


2.13 


0.004 



"Fold Change" indicates the fold expression calculated as the ratio of the mean rosi 
expression/ mean vehicle expression. Numbers in parentheses indicates the number of 



20 primary human adipocyte samples analyzed by real-time PCR. 

[455] Probe set 204239 detects NNAT nucleic acid sequences. Expression of 
NNAT transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[456] NNAT contains the following protein domains (designated with 
reference to SEQ ID NO: 188): Signal peptide at amino acids 1 to 23; and 1 transmembrane 
domain (TMHMM2.0) at amino acids 13 to 35. NNAT is a putative proteolipid that may 
regulate ion channels during brain development. It is found to be highly expressed in pituitary 
adenomas and is frequently hypeimethylated in childhood myeloid and lymphoid acute 
leukemias (Dou D. and Joseph R. Genomics 33(2): 292-7 (1 996); Usui H. et al t JMol 
Neurosci. 9(1): 55-60 (1997); Evans EK. et al. t Genomics 77(1-2): 99-104 (2001)). 

OLFM2 

[457] Probe set 223601 detects OLFM2 nucleic acid sequences. Expression 
of OLFM2 transcripts was decreased in diabetic compared to lean patients in the gene 
profiling experiment. 
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"SENT* indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[458] OLFM2 was also evaluated using real-time PCR. The results further 
show that OLFM2 is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10)/ Lean (8) ^ 


0.34 


0.029 



25 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[459J Probe set 223601 detects OLFM2 nucleic acid sequences. Expression 
of OLFM2 transcripts was increased in adipose tissues compared to all other human adult 
tissues in the gene profiling experiment. 
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OLFM2 



'Mean Expr'* indicates mean expression; "SEM" indicates standard error of mean; "n" 



indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[460J OLFM2 was over-expressed in 3T3-L1 adipocytes and the effect on 
5 basal and insulin stimulated glucose transport was determined 



Glucose Transport Analysis in 3T3-L1 Adipocytes: 
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Legend: "Con" indicates control 3T3-L1 adipocytes that do not express OLFM2, "FC" 
indicates the fold change defined as the following ratio; glucose transport in OLFM2- 
1 0 expressing 3T3-L1 adipocytes /glucose transport in non-OLFM2-expressing 3T3-L1 

adipocytes, h" is human, "n" is the number of experiments. SEM is the standard error of the 
mean. 



(461] The results show that increasing the levels of OLFM2 in a cell such as 
IS an adipocyte leads to a corresponding decrease in glucose uptake. This indicates that 

decreasing the levels or activity of OLFM2 in tissues of insulin resistant patients or diabetic 
patients will increase the ability of such tissues to take up glucose and hence, will provide an 
effective treatment for insulin resistance and diabetes. 

(462] OLFM2 contains the following protein domains (designated with 
20 reference to SEQ ID NO: 196): Signal peptide at amino acids 1 to 24; and Olfactomedin-like 
domain (PF02191) at amino acids 196 to 446. A soluble active secreted form of OLFM2 has 
been predicted and this is displayed in SEQ ID NO: 197. OLFM2 is a protein which possess 
high similarity to olfactomedin 3 (rat 01fin3), which is known to interact with myocilin and is 
associated with glaucoma and disorders involving the anterior segment of the eye and the 
25 retina (Ortego J. et aL FEBSLetL 413(2): 349-53 (1997). 

123 

BNSOOCID: <WO__20060074a*2J_> 



WO 2006/007400 



PCT7US2005/021297 



OPN3 

[463] Probe set 219032 detects OPN3 nucleic acid sequences. Expression of 



OPN3 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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5 B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; 'Told 
Change" indicates fold change of obese in comparison to lean patients. 



(464J OPN3 was also evaluated using real-time PCR. The results further 
show that OPN3 is significantly over-expressed in subcutaneous adipose from obese 



1 0 individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (5) /Lean (4) 


1.74 


0.028 



"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 



[465] OPN3 contains the following protein domains (designated with 
15 reference to SEQ ID NO:203): 7 transmembrane receptor (rhodopsin family) (PF00001) at 
•amino acids 58 to 309; and 7 transmembrane domains (TMHMM2.0) at amino acids 44 to 66, 
78 to 100, 1 15 to 137, 158 to 180, 200 to 222, 258 to 280, 290 to 312. OPN3 is a member of 
the opsins receptor cluster of G protein-coupled receptors which may play a role in non- 
visual photic processes such as the entrainment of circadian rhythm or the regulation of 
20 pineal melatonin production (Blackshaw S., and Snyder S.H., JNeuroscL 19(10): 3681-90 
(1999); Halford S. et ai. Genomics 72(2): 203-8 (2001)). 

PTPRE 

[466] Probe set 22 1 840 detects PTPRE nucleic acid sequences. Expression of 
PTPRE transcripts was increased in diabetic compared to lean patients in the gene profiling 
25 experiment. 
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41 SEM" indicates standard error of mean; "n" indicates number of patient samples; *Told 
Change" indicates fold change of diabetics in comparison to lean patients. 

(467J PTPRE was also evaluated using real-time PCR. The results further 
show that PTPRE is significantly over-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Chance 


ttest 


Diabetic (10) /Lean (8) 


1.8 


0.001 



the mean diabetic expression/ mean lean expression. Numbers in parentheses indicates the 
number of patient samples analyzed by real-time PCR 

1469] PTPRE was over-expressed in 3T3-L1 adipocytes and the effect on 
basal and insulin stimulated glucose transport and Glut 4 translocation were determined. 



Insulin (nM) 


Fold Change (Mean hPTPRE/Mean LacZ) (n=3) 


ttest 

(hPTPRE vs LacZ) 


0 


0.24 


0.001 


0.5 


0.48 


0.01 


10 

V 1 44T"» 1 J 


0.42 


0.001 



LI adipocytes that were scored positive for cell surface Glut4)/(Mean % of LacZ-expressing 
3T3-L1 adipocytes that were scored positive for cell surface Glut4). "h" is human. V* is the 
number of experiments. 

[470] Increasing the levels of hPTPRE in 3T3-L1 adipocytes significantly 
inhibits basal and insulin-stimulated Glut4 translocation to the cell surface. 
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Glucose Transport Analysis in 3T3-L1 Adipocytes: 





Experiment 1 


Experiment 2 


Experiment 3 




Insulin 
(nM) 


FC 
(hPTPRE 
/Con) 


ttest 
(«) 


FC 
(hPTPRE 
/Con) 


ttest 
(a) 


FC 
(hPTPRE 
/Con) 


ttest 

(a) 


Mean FC 
+/- SEM (n=3) 


0 


1.06 


0.747 


0.85 


0.554 


0.84 


0.021 


0.92±0.07 


0.05 


0.80 


0.095 


1.23 


0.163 


0.66 


0.046 


0.90±0.17 


0.1 


0.67 


0.208 


0.73 


0.041 


0.58 


0.048 


0.66±0.04 


0.3 


0.82 


0.115 


0.73 


0.027 


0.59 


0.040 


0.71±0.07 


1 


0.94 


0.340 


0.76 


0.006 


0.65 


0.018 


0.78±0.08 


10 


0.95 


0.192 


0.83 


0.067 


0.72 


0.008 


0.83±0.07 



Legend: "Con" indicates control 3T3-L1 adipocytes that do not express hPTPRE. "FC" 
indicates the fold change defined as the following ratio; glucose transport in hPTPRE- 
expressing 3T3-L1 adipocytes /glucose transport in non-PTPRE-expressing 3T3-L1 
adipocytes, h" is human, "n" is the number of experiments. SEM is the standard error of the 
mean. 



[471] The results show that increasing the levels of PTPRE in a cell such as 
an adipocyte leads to a corresponding decrease in glucose uptake. This indicates that 
1 0 decreasing the levels or activity of PTPRE in tissues of insulin resistant patients or diabetic 
patients will increase the ability of such tissues to take up glucose and hence, will provide an 
effective treatment for insulin resistance and diabetes. 

1472J PTPRE contains the following protein domains (designated with 
reference to SEQ ID NO:213): Signal peptide at amino acids 1 to 19; Protein-tyrosine 
15 phosphatase (PF00102) at amino acids 159 to 393, 451 to 688; and 1 transmembrane domain 
(TMHMM2.0) at amino acids 47 to 69. PTPRE is found to exist in both a soluble, 
cytoplasmic and a transmembrane form. PTPRE is induced by IL1 and TNFA treatment in 
astrocytoma cells suggesting a role in the inflammatory response of the brain (Schumann,G. 
et aL, Brain Res Mol Brain Res. 62: 56-64 (1988)). Other studies suggest a role for PTPRE in 
20 RAS related signal transduction pathways, cytokines induced signaling, activation of 

voltage-gated K+ channels and vascular development and angiogenesis (Tiran, Z.J. et al t J. 
Biol Chem. 278: 17509-14(2003); Toledano-Katchalski,H., et al t Mol Cancer Res. 1: 541-50 
(2003); Thompson, L.J., et aL Am J Physiol Heart Circ Physiol. 281 : H396-403 (2001)). 

14731 PTPRE can dephosphorylate the tyrosine phosphorylated insulin 
25 receptor (see f e.g., Nakagawa Y. et aL Zoolog Scl 22: 169-75 (2005). Thus, an exemplary 
assay for inhibitors of PTPRE comprises determining the ability of candidate compounds to 
inhibit the ability of purified PTPRE to dephosphorylate a tyrosine phosphorylated peptide, 
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e.g., a tyrosine phosphorylated peptide derived from the sequence of the insulin receptor and 
containing autophosphorylation sites such as Tyrosines 1 1 158, 1 162 and 1 163. 

RDC1 

[474J Probe set 212977 detects RDC1 nucleic acid sequences. Expression of 
RDCl transcripts was decreased in diabetic compared to lean patients in the gene profiling 
experiment. 





DIABETIC 


LEAN 




B/C 


Mean 
Expr 


SEM 


n 


Mean 
Expr 


SEM 


n 


Fold 
Change 


Students t 
test 


Gene 
Name 


C 


169.88 


9.84 


10 


238.53 


13.88 


8 


0.71 


0.001 


RDCl 



10 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

[475J RDCl was also evaluated using real-time PCR. The results further 
show that RDCl is significantly under-expressed in subcutaneous adipose from diabetic 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Diabetic (10) / Lean (8) 


0.65 J 


0.008 



15 



'Told Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[476] Probe set 212977 detects RDCl nucleic acid sequences. Expression of 
RDCl transcripts was decreased in patients with insulin resistance compared to normal, 
patients in the gene profiling experiment. 





CORRELATIOl 


NRD 




B/C 


n 


Corr Co-efficient 


Students t test 


Gene Name 


C 


26 


0.649 


<0.005 


RDCl 



20 



25 



B/C indicates sample is from Basal or Clamp; "Corr Co-efficient" indicates the relationship 
between glucose disposal rate (Rd) and signal intensities. A positive co-efficient indicates 
down regulation whereas a negative co-efficient indicates up regulation of the gene with 
increasing insulin resistance; "n" indicates number of patient samples. 

[477J RDC 1 was also evaluated using real-time PCR. The results further 
show that RDCl is significantly decreased in patients with insulin resistance compared to 
normal patients. 
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Comparison 


Expression Fold Chanee 


ttest 


Correlation to Rd (26) 


0.567 ( 


<0.005 



"Corr Co-efficient" indicates the relationship between Rd and signal intensities. A positive 
co-efficient indicates down regulation whereas a negative co-efficient indicates up regulation 
of the gene with increasing insulin resistance. Numbers in parentheses indicates the number 
of patient samples analyzed by real-time PCR. 



5 [478] Probe set 212977 detects RDC1 nucleic acid sequences. Expression of 

RDC1 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 





OBESE 


LEAN 




B/C 


Mean 
Expr 


SEM 
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Mean 
Expr 


SEM 


n 


Fold 
Chanee 


Students t 
test 


Gene 
Name 


B 


237.66 


36.98 


5 


153.85 


5.4 


4 


1.54 


0.086 


RDC1 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; 'Told 



10 Change" indicates fold change of obese in comparison to lean patients. 

[479] Probe set 212977 detects RDC1 nucleic acid sequences. Expression of 



RDC1 transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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OTHER! 
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Mean 
Expr 


SEM 
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Mean 
Expr 


SEM 
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Fold 
Chanee 


Students t 
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Gene 
Name 


261.58 


37.67 


5 


46.15 


9.41 


13 


5.67 


0.004 


RDC1 



"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; "n" 



1 5 indicates number of tissue samples; 'Told Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[480] RDC1 contains the following protein domains (designated with 
reference to SEQ ID NO:223): 7 transmembrane receptor (rhodopsin family) (PF00001) at 
amino acids 61 to 315; and 7 transmembrane domains (TMHMM2.0) at amino acids 47 to 69, 

20 82 to 104, 1 19 to 140, 160 to 182, 214 to 236, 255 to 277, 297 to 319. RDC1 is considered to 
be a new member of the rhodopsin family of G-protein coupled receptors. The protein is a co- 
receptor for human immunodeficiency viruses (HIV). Translocations involving this gene and 
HMG A2 on chromosome 1 2 have been observed in lipomas (Broberg, K., et al, Int J Oncol 
21: 321-6 (2002); Shimizu, N., et al, J Virol 74: 619-26 (2000)). 
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SLIT2 

[481J Probe set 209897 detects SUT2 nucleic acid sequences. Expression of 
SLIT2 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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Mean 
Expr 


SEM 
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Mean 
Expr 


SEM 


n 


Fold 
Change 


Students t 
test 


Gene 
Name 


B 


81.28 


5.02 


5 


48.53 


3.08 


4 


1.68 


0.001 


SLIT2 | 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; V indicates number of patient samples; *Told 
Change" indicates fold change of obese in comparison to lean patients. 

[4821 SLIT2 was also evaluated using real-time PCR. The results further 
show that SLIT2 is significantly over-expressed in subcutaneous adipose from obese , 
individuals when compared to subcutaneous adipose from lean individuals. 



Comparison 


Expression Fold Change 


ttest 


Obese (5) / Lean (4) 


3.31 


0.014 



expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR- 

[483] Cellular levels of SLIT2 were reduced in 3T3-L1 adipocytes using 
sLRNA directed against SLIT2 and the effect on basal and insulin stimulated glucose 
transport was determined 



SLTT2 mRNA Level in 3T3-L1 Adipocytes Transfected with siRNA Oligonucleotides 





Exptl 


£xpt2 


Expt3 


Mean FC 1 

dbSEM 
(n=3) 


FC 

(siRNA/Scr) 


0.29 


0.22 


0.26 


0.26 ±0.02 



Legend: "siRNA" indicates Dharmacon Smartpool siRNA oligonucleotides directed against 
murine SLIT2. "Scr" indicates the Dharmacon Scramble siRNA Control. "FC" indicates the 
fold change defined as the following ratio; Level of SL1T2 mRNA in SLIT2 siRNA 
transfected 3T3-L1 adipocytes/Level of SLIT2 mRNA in Scramble siRNA transfected 3T3- 
Ll adipocytes, "n M is the number of experiments. SEM is the standard error of the mean. 

Glucose Transport in 3T3-L1 Adipocytes Transfected with siRNA Oligonucleotides 





Expt 1 _j 


Expt 2 


Expt 3 




Insulin 
<nM) 


FC 
(siRNA/Scr) 


t-test 


FC 
(siRNA/Scr) 


t-test 


FC 
(siRNA/Scr) 


t-test 


Mean FC 
(siRNA/Scr) 
±SEM (n«3) 
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0.0 


1.235 


0.020 


0.948 


0.001 


0.639 


0.600 


0.941*0.07 




0.1 


0.902 


0.050 


0.694 


O.001 


0.573 


0.020 


0.723 *0.04 




3.0 


1.094 


0.001 


1.029 


<0.001 


0.709 


0.490 


0.944 ± 0.04 




Legend: 


"siRNA" indicates Dharmacon Smartpool siRNA o 


igonuc 


leotides direct 


ed 



against murine SLIT2. "Scr" indicates Dharmacon Scramble siRNA Control oligonucleotides. 
"FC" indicates the fold change defined as the following ratio; glucose transport in SUT2 
siRNA transfected 3T3-L1 adipocytes/glucose transport in Scramble siRNA transfected 3T3- 
LI adipocytes, V is the number of experiments. SEM is the standard error of the mean. 



[484] The results show that decreasing the levels of SLIT2 in a cell such as a 
adipocyte leads to a corresponding decrease in glucose uptake. This indicates that increasing 
the levels or activity of SLIT2 in tissues of insulin resistant patients or diabetic patients will 

1 0 increase the ability of such tissues to take up glucose and hence, will provide an effective 
treatment for insulin resistance and diabetes. 

[485] SLIT2 contains the following protein domains (designated with 
reference to SEQ ID NO:229): Signal peptide at amino acids 1 to 30; Leucine rich repeat C- 
- terminal domain (PF01463) at amino acids 233 to 258, 454 to 479, 688 to 713, 883 to 

15 908; Leucine rich repeat N-terminal domain (PF01462) at amino acids 27 to 54, 272 to 299, 
505 to 532, 726 to 753; Leucine Rich Repeat (PF00560) at amino acids 56 to 79, 1 28 to 1 5 1 , 
176 to 199, 325 to 348, 349 to 372, 559 to 582, 607 to 630, 778 to 801, 802 to 825, 826 to 
849; Laminin G domain (PF00054) at amino acids 1 188 to 1319; and EGF-like domain 
(PF00008) at amino acids 922 to 954, 961 to 995, 1002 to 1033, 1040 to 1073, 1080 to 1 1 1 1, 

20 1 125 to 1156, 1336 to 1367, 1375 to 1406, 1416 to 1447. A soluble active secreted form of 
SLIT2 has been detected (Nguyen Ba-Charvet, K.T. et al. J. Neurosc. 21: 4281-4289 
(2001)) and this is displayed in SEQ ID NO:230. SLIT2 is the ligand for roundabout receptor 
ROBOl . Mammalian SLIT proteins may participate in the formation and maintenance of the 
nervous and endocrine systems by protein-protein interactions. SLIT2 has been reported to 

25 be a cheraorepellant for neuronal migration in vivo induces branching of dorsal root ganglia 
axons (Nguyen Ba-Charvet KX et al t JNeuroscL, 21(12): 4281-9 (2001); Nguyen Ba- 
Charvet K.T. et al, J Physiol Paris. 96: 91-8 (2002)), SLIT2 also found to inhibit leukocyte 
chemotaxis induced by chemotactic factors (Wu, J.Y., et al, Nature 410: 948-52 (2001). 

[486] [The human SLIT2 gene has been cloned and the proximal promoter 

30 has been identified (see, e.g, Dallol A. et al. Cancer Res. 62:5874-80 (2002). Therefore, one 
example of a method of screening for SLIT2 regulators is as follows. A SLIT2 promoter 
can be inserted upstream of a reporter gene such as p-galactosidasc and expressed in cells. 
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Compounds that up-regulate the activity of the promoter may therefore be identified by 
measuing increased p-galactosidase activity. 

TNFRSF21 

[487J Probe set 214581 detects TNFRSF21 nucleic acid sequences. 
Expression of TNFRSF21 transcripts was increased in obese compared to lean patients in the 
gene profiling experiment. 
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67.98 


8.98 
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25.13 


3.84 


4 


2.71 


0.006 


TNFRSF21 



10 



"SENT* indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[488] Probe set 2 1 458 1 detects TNFRSF2 1 nucleic acid sequences. 
Expression of TNFRSF21 transcripts was increased in rosi compared to vehicle treated 
cultures of primary human adipocytes in the gene profiling experiment. 
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52.77 
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1.56 


0.016 


TNFRSF21 
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20 



B/C indicates sample is from Basal or Clamp; "Pre-Rosi" and "Post-Rosi" indicates sample 
was taken before or after 24 hours of rosiglitazone treatment; "Mean Expr" indicates mean 
expression; "SEM" indicates standard error of mean; "n" indicates number of patient 
samples; "Fold Change" indicates fold change of primary human adipocytes post-rosi in 
comparison to pre-rosi samples. 

[489] TNFRSF21 was also evaluated using real-time PCR. The results 
further show that TNFRSF21 is significantly over-expressed in primary cultured human 
adipocytes treated with rosi when compared to vehicle. 



Comparison 


Expression Fold Change 


ttest 


Post-Rosi (12)/ Pre-Rosi (12) 

4tT?-U »9 ' J • * »1 r.u 


1.32 


0.007 



25 



expression/ mean vehicle expression. Numbers in parentheses indicates the number of 
primary human adipocyte samples analyzed by real-time PCR. 

[490] Probe set 214581 detects TNFRSF21 nucleic acid sequences. 
Expression of TNFRSF21 transcripts was increased in adipose tissues compared to all other 
human adult tissues in the gene profiling experiment. 
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70.58 


15.04 


5 


16.12 


2.78 


13 


4.38 


0.021 


TNFRSF21 



"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; V 
indicates number of tissue samples; 'Told Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[491] TNFRSF21 was also evaluated using real-time PCR. The results 
further show that TNFRSF21 is significantly over-expressed in adipose tissues when 
compared to all other human adult tissues. 



Comparison 


Expression Fold Change 


ttest 


Fat Tissues (5)/ All Other Tissues (13) 


2.82 


0.011 



10 



tissues expression/ mean other tissues expression. Numbers in parentheses indicates the 
number of human adult tissue samples analyzed by real-time PCR. 

[4921 TNFRSF21 was over-expressed in 3T3-L1 adipocytes and the effect on 
basal and insulin stimulated glucose transport was determined. 



Glucose Transport 





Experiment 1 


Experiment 2 


Experiment 3 
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FC 
(hTNFRS 
F21/Con) 


ttest 


FC 
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F21/ 
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ttest 


Mean FC 
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+/- SEM (n=3) 
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0.99 
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0.78 


0.066 


0.71 
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0.83±0.08 


0.05 


0.74 


0.048 


0.84 


0.029 


0.74 


0.107 


0.77±0.03 | 
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0.73 
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0.75 


0.133 


0.68 


0.306 


0.72±0.02 


0.3 


0.79 


0.062 


0.88 


0.112 


0.88 


0.709 


0.85±0.03 


1 


0.84 


0.075 


0.96 


0.479 


0.92 


0.621 


0.91±0.04 


10 ] 


0.95 


0.760 


0.93 


0.400 


0.93 


0.839 


0.94±0.01 


Legend: "Con" indicates control 3T3-L1 adipocytes tl 


lat do not express hTN 


[FRSF21. "FC" 



1 5 indicates the fold change defined as the following ratio; glucose transport in hTNFRSF21- 
expressing cells/glucose transport in non-TNFRSF21 -expressing cells, h" is human, "n" is 
the number of experiments. SEM is the standard error of the mean. 

[4931 The results show that increasing the levels of TNFRSF21 in a cell such 
20 as an adipocyte leads to a corresponding decrease in glucose uptake. This indicates that 
decreasing the levels or activity of TNFRSF21 in tissues of insulin resistant patients or 
diabetic patients will increase the ability of such tissues to take up glucose and hence, will 
provide an effective treatment for insulin resistance and diabetes. 
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1494] TNFRSF2 1 contains the following protein domains (designated with 
reference to SEQ ID NO:236): Signal peptide at amino acids 1 to 41; Death domain 
(PF00531) at amino acids 416 to 498; TNFR/NGFR cysteine-rich region (PF00020) at amino 
acids 50 to 88, 91 to 131. 133 to 168, 171 to 21 1; and 1 transmembrane domain 
(TMHMM2.0) at amino acids 350 to 369. TNFRSF21 has been shown to activate NF-kappaB 
and MAPK8/JNK, and induce cell apoptosis. Through its death domain, this receptor 
interacts with TRADD protein, which is known to serve as an adaptor that mediates signal 
transduction of TNF-receptors. Knockout studies in mice suggested that this gene plays a role 
in T-helper cell activation, and may be involved in inflammation and immune regulation (Pan 
G. etai, FEBS Lett. 431(3): 351-356(1998); Kasof G.M. etal. t Oncogene 20(55): 7965- 
7975(2001)). 

. [495] TNFRSF2 1 is up-regulated by TNFalpha. which itself has been 
associated with states of insulin resistance (see, e.g., Hotamisligil GS. J Intern Med. 245:621- 
625 (1999);' .Kasof GM et al Oncogene. 20 :7965-7975 (2001)). An exemplary assay to 
identify compounds that suppress TNFRSF21 can thus employ cells such as LnCAP to screen 
candidate compounds for the ability to inhibit TNFalpha-induced up-regulation of 
TNFRSF21 mRNA or protein. 



20 



TNFSF13B 

[496] Probe set 223501 detects TNFSF13B nucleic acid sequences. 
Expression of TNFSF13B transcripts was increased in diabetic compared to lean patients in 
the gene profiling experiment. 





DIAE 


tETIC 


LEAN 




B/C 


Mean . 
Expr 


SEM 


n 


Mean 
Expr 


SEM 


n 


Fold 
Change 


Students t 
test 


Gene 
Name 


C 


44.86 


3.06 


10 


28.79 


3.14 


8 


1.56 


0.002 


TNFSF13B 



25 



B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; V indicates number of patient samples; "Fold 
Change" indicates fold change of diabetics in comparison to lean patients. 

1497] TNFSF13B was also evaluated using real-time PCR. The results 
further show that TNFSF13B is significantly over-expressed in subcutaneous adipose from 
diabetic individuals when compared to subcutaneous adipose from lean individuals. 
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Comparison 


Expression Fold Change 


ttest ! 


Diabetic (10) /Lean (8) 


1.54 


0.007 



10 



"Fold Change" indicates the fold expression calculated as the ratio of the mean diabetic 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[498] TNFSF1 3B contains the following protein domains (designated with 
reference to SEQ ID NO:242): Signal anchor at amino acids 0 to 0; TNF(Tumour Necrosis 
Factor) family (PF00229) at amino acids 166 to 284; and 1 transmembrane domain 
(TMHMM2.0) at amino acids 48 to 70. A soluble active secreted form of TNFSF13B has 
been detected (Schneider, P. et al t J Exp. Med, 189: 1747-1756 (1999)) and this is displayed 
in SEQ ID NO:243. TNFSF13B is a ligand for multiple receptors including 
TNFRSF 1 3B/T ACI, TNFRSF1 7/BCMA, and TNFRSF 1 3C/B AFFR. This cytokine is 
expressed in B cell lineage cells, and acts as a potent B cell activator. It has been also shown 
to play an important role in the proliferation and differentiation of B cells (Patke, A., et ai, 
Curr Opin Immunol 16: 251-5 (2004); Schneider, P. andTschopp, J. Immunol Lett 88: 57- 
62 (2003)). 



15 TNFSF14 



[499] Probe set 207907 detects TNFSF14 nucleic acid sequences. Expression 
of TNFSF14 transcripts was increased in omental tissue compared to all other human adult 
tissues in the gene profiling experiment. 
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"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; V 
indicates number of tissue samples; "Fold Change" indicates fold change of omental tissues 
in comparison to all other human adult tissues profiled. 

[500] TNFSF14 was also evaluated using real-time PCR. The results further 
show that TNFSF14 is significantly over-expressed in omental tissue when compared to all 
other human adult tissues. 



Comparison 


Expression Fold Change 


ttest 


Omental (1) / All Other Tissues (17) 


16.75 


n.d. 
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'Told Change" indicates the fold expression calculated as the ratio of the mean omental tissue 
expression/ mean other tissues expression. Numbers in parentheses indicates the number of 
human adult tissue samples analyzed by real-time PCR. 
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[501] TNFSF14 contains the following protein domains (designated with 
reference to SEQ ID NO:251): Signal anchor at amino acids 0 to 0; TNF(Tumour Necrosis 
Factor) family (PF00229) at amino acids 93 to 240; and 1 transmembrane domain 
(TMHMM2.0) at amino acids 36 to 58. A soluble active secreted form of TNFSF14 has been 
detected (Harrop, J. A., etaL t J Biol CAem.273:27548-56 (1998)) and this is displayed in SEQ 
ID NO:252. TNFSF14 is the ligand for HVEM (TNFRSF14) and is reported to induce 
lymphocyte proliferationinduces apoptosis and suppresses in vivo tumor formation and 
facilitate herpes vims entry (Mauri D.N. et al t Immunity 8(1): 21-30 (1998); Zhai Y. et ai t J 
Clin Invest 102(6): 1142-51 (1998);Castellano, R. et at., J Biol Chem. 277(45): 42841-51 
(2001)). 



TPSB2 

1502J Probe set 205683 detects TPSB2 nucleic acid sequences. Expression of 
TPSB2 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
''SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[503J TPSB2 was also evaluated using real-time PCR. The results further 
show that TPSB2 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 
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"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

|504] Probe set 205683 detects TPSB2 nucleic acid sequences. Expression of 
TPSB2 transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profi ling experiment 
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"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; "n" 
indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 

[505] TPSB2 contains the following protein domains (designated with 
reference to SEQ ID NO:262): Signal peptide at amino acids 1 to 18; and Trypsin (PF00089) 
at amino acids 31 to 267. A soluble active secreted form of TPSB2 has been detected and this 
is displayed in SEQ ID NO:263. TPSB2 is a tryptase beta 2 and belongs to the family of mast 
cell serine proteases which have been implicated as mediators in the pathogenesis of asthma 
and other allergic and inflammatory disorders. Beta tryptases appear to be the main 
isoenzymes expressed in mast cells (Pallaoro M. et al t J Biol Chem., 274(6): 3355-62 (1999). 

WISP2 

[506] Probe set 205792 detects WISP2 nucleic acid sequences. Expression of 
WISP2 transcripts was increased in obese compared to lean patients in the gene profiling 
experiment. 
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B/C indicates sample is from Basal or Clamp; "Mean Expr" indicates mean expression; 
"SEM" indicates standard error of mean; "n" indicates number of patient samples; "Fold 
Change" indicates fold change of obese in comparison to lean patients. 

[507] WISP2 was also evaluated using real-time PCR. The results further 
show that WISP2 is significantly over-expressed in subcutaneous adipose from obese 
individuals when compared to subcutaneous adipose from lean individuals. 
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"Fold Change" indicates the fold expression calculated as the ratio of the mean obese 
expression/ mean lean expression. Numbers in parentheses indicates the number of patient 
samples analyzed by real-time PCR. 

[508] Probe set 205792 detects WISP2 nucleic acid sequences. Expression of 
WISP2 transcripts was increased in adipose tissues compared to all other human adult tissues 
in the gene profiling experiment. 
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"Mean Expr" indicates mean expression; "SEM" indicates standard error of mean; "n" 
indicates number of tissue samples; "Fold Change" indicates fold change of adipose tissues in 
comparison to all other human adult tissues profiled. 



[S09J WISP2 contains the following protein domains (designated with 
reference to SEQ ID NO:269): Signal peptide at amino acids 1 to 23; Insulin-like growth 
factor binding protein (PF0021 9) at amino acids 26 to 96; von Willebrand factor type C 
domain (PF00093) at amino acids 100 to 163; Thrombospondin type 1 domain (PF00090) at 
amino acids 196 to 237; and TNFR/NGFR cysteine-rich region (PF00020) at amino acids 39 
to 70. A soluble active secreted form of WISP2 has been detected (Pennica, D., et al. Proc 
Natl Acad Sci USA. 95:14717-22 (1998)) and this is displayed in SEQ ID NO:270. WISP2 is 
a member of the CCN family of growth factors and is found to promote the adhesion of 
osteoblast cells. Decreased expression of WISP2 may be therapeutic in treatment of breast 
cancer (Kumar S. et al, J Biol Chem., 274(24): 17123-31 (1999); Pennica D. et al, Proc Natl 
Acad Set USA. 95(25): 14717-22 (1998)). WISP2 is also found to inhibit vascular smooth 
muscle cell proliferation, motility, and invasiveness.( Lake, A.C. et al, Am J Pathol 162: 
219-31 (2003). 
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SEQUENCE LISTING 



SEQIDNO: 1 

gi|18390318|ref|NM^015419.1| Homo sapiens adlican (DKFZp564I1922), mRNA 

1 atgcccaagc gcgcgcactg gggggccctc tccgtggtgc tgatcctgct ttggggccat 
5 61 ccgcgagtgg cgctggcctg cccgcatcct tgtgcctgct acgtccccag cgaggtccac 
121 tgcacgttcc gatccctggc ttccgtgccc gctggcattg ctagacacgt ggaaagaatc 
181 aatttggggt ttaatagcat acaggccctg tcagaaacct catttgcagg actgaccaag 
241 ttggagctac ttatgattca cggcaatgag atcccaagca tccccgatgg agctttaaga 
301 gacctcagct ctcttcaggt tttcaagttc agctacaaca agctgagagt gatcacagga 
10 361 cagaccctcc agggtctctc taacttaatg aggctgcaca ttgaccacaa caagatcgag 
421 tttatccacc ctcaagcttt caacggctta acgtctctga ggctactcca tttggaagga 
481 aatctcctcc accagctgca ccccagcacc ttctccacgt tcacattttt ggattatttc 
541 agactctcca ccataaggca cctctactta gcagagaaca tggttagaac tcttcctgcc 
601 agcatgcttc ggaacatgcc gcttctggag aatctttact tgcagggaaa tccgtggacc 
15 661 tgcgattgtg agatgagatg gtttttggaa tgggatgcaa aatccagagg aattctgaag 
721 tgtaaaaagg acaaagctta tgaaggcggt cagttgtgtg caatgtgctt cagtccaaag 
781 aagttgtaca aacatgagat acacaagctg aaggacatga cttgtctgaa gccttcaata 
841 gagtcccctc tgagacagaa caggagcagg agtattgagg aggagcaaga acaggaagag 
901 gatggtggca gccagctcat cctggagaaa ttccaactgc cccagtggag catctctttg 
20 961 aatatgaccg acgagcacgg gaacatggtg aacttggtct gtgacatcaa gaaaccaatg 
1021 gatgtgtaca agattcactt gaaccaaacg gatcctccag atattgacat aaatgcaaca 
1081 gttgccttgg actttgagtg tccaatgacc cgagaaaact atgaaaagct atggaaattg 
1141 atagcatact acagtgaagt tcccgtgaag ctacacagag agctcatgct cagcaaagac 
1201 cccagagtca gctaccagta caggcaggat gctgatgagg aagctcttta ctacacaggt 
25 1261 gtgagagccc agattcttgc agaaccagaa tgggtcatgc agccatccat agatatccag 
1321 ctgaaccgac gtcagagtac ggccaagaag gtgctacttt cctactacac ccagtattct 
1381 caaacaatat ccaccaaaga tacaaggcag gctcggggca gaagctgggt aatgattgag 
1441 cctagtggag ctgtgcaaag agatcagact gtcctggaag ggggtccatg ccagttgagc 
1501 tgcaacgtga aagcttctga gagtccatct atcttctggg tgcttccaga tggctccatc 
30 1561 ctgaaagcgc ccatggatga cccagacagc aagttctcca ttctcagcag tggctggctg 
1621 aggatcaagt ccatggagcc atctgactca ggcttgtacc agtgcattgc tcaagtgagg 
1681 gatgaaatgg accgcatggt atatagggta cttgtgcagt ctccctccac tcagccagcc 
1741 gagaaagaca cagtgacaat tggcaagaac ccaggggagt cggtgacatt gccttgcaat 
1801 gctttagcaa tacccgaagc ccaccttagc tggattcttc caaacagaag gataattaat 
35 1B61 gatttggcta acacatcaca tgtatacatg ttgccaaatg gaactctttc catcccaaag 
1921 gtccaagtca gtgatagtgg ttactacaga tgtgtggctg tcaaccagca aggggcagac 
1981 cattttacgg tgggaatcac agtgaccaag aaagggtctg gcttgccatc caaaagaggc 
2041 agacgcccag gtgcaaaggc tctttccaga gtcagagaag acatcgtgga ggatgaaggg 
2101 ggctcgggca tgggagatga agagaacact tcaaggagac ttctgcatcc aaaggaccaa 
40 2161 gaggtgttcc tcaaaacaaa ggatgatgcc atcaatggag acaagaaagc caagaaaggg 
2221 agaagaaagc tgaaactctg gaagcattcg gaaaaagaac cagagaccaa tgttgcagaa 
2281 ggtcgcagag tgtttgaatc tagacgaagg ataaacatgg caaacaaaca gattaatccg 
2341 gagcgctggg ctgatatttt agccaaagtc cgtgggaaaa atctccctaa gggcacagaa 
2401 gtacccccgt tgattaaaac cacaagtcct ccatccttga gcctagaagt cacaccacct 
45 2461 tttcctgctg tttctccccc ctcagcatct cctgtgcaga cagtaaccag tgctgaagaa 
2521 tcctcagcag atgtacctct acttggtgaa gaagagcacg ttttgggtac catttcctca 
2581 gccagcatgg ggctagaaca caaccacaat ggagttattc ttgttgaacc tgaagtaaca 
2641 agcacacctc tggaggaagt tgttgatgac ctttctgaga agactgagga gataacttcc 
2701 actgaaggag acctgaaggg gacagcagcc cctacactta tatctgagcc ttatgaacca 
50 . 2761 tctcctactc tgcacacatt agacacagtc tatgaaaagc ccacccatga agagacggca 
2821 acagagggr.t ggtctgcagc agatgttgga tcgtcaccag agcccacatc cagtgagtat 
2881 gagcctccat tggatgctgt ctccttggct gagtctgagc ccatgcaata ctttgaccca 
2941 gatttggaga ctaagtcaca accagatgag gataagatga aagaagacac ctttgcacac 
3001 cttactccaa cccccaccat ctgggttaat gactccagta catcacagtt atttgaggat 
55 3061 tctactatag gggaaccagg tgtcccaggc caatcacatc tacaaggact gacagacaac 
3121 atccaccttg tgaaaagtag tctaagcact caagacacct tactgattaa aaagggtatg 
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3181 aaagagatgt ctcagacact acagggagga aatatgctag agggagaccc cacacactcc 
3241 agaagttctg agagtgaggg ccaagagagc aaatccatca ctttgcctga ctccacactg 
3301 ggtataatga gcagtatgtc tccagttaag aagcctgcgg aaaccacagt tggtaccctc 
3361 ctagacaaag acaccacaac agtaacaaca acaccaaggc aaaaagttgc tccgtcatcc 
5 3421 accatgagca ctcacccttc tcgaaggaga cccaacggga gaaggagatt acgccccaac 
3481 aaattccgcc accggcacaa gcaaacccca cccacaactt ttgccccatc agagactttt 
3541 tctactcaac caactcaagc acctgacatt aagatttcaa gtcaagtgga gagttctctg 
3601 gttcctacag cttgggtgga taacacagtt aataccccca aacagttgga aatggagaag 
3661 aatgcagaac ccacatccaa gggaacacca cggagaaaac acgggaagag gccaaacaaa 
10 3721 catcgatata ccccttctac agtgagctca agagcgtccg gatccaagcc cagcccttct 
3781 ccagaaaata aacatagaaa cattgttact cccagttcag aaactatact tttgcctaga 
3841 actgtttctc tgaaaactga gggcccttat gattccttag attacatgac aaccaccaga 
3901 aaaatatatt catcttaccc taaagtccaa gagacacttc cagtcacata taaacccaca 
3961 tcagatggaa aagaaattaa ggatgatgtt gccacaaatg ttgacaaaca taaaagtgac 
15 4021 attttagtca ctggtgaatc aattactaat gccataccaa cttctcgctc cttggtctcc 
4081 actatgggag aatttaagga agaatcctct cctgtaggct ttccaggaac tccaacctgg 
4141 aatccctcaa ggacggccca gcctgggagg ctacagacag acatacctgt taccacttct 
4201 ggggaaaatc ttacagaccc tccccttctt aaagagcttg aggatgtgga tttcacttcc 
4261 gagtttttgt cctctttgac agtctccaca ccatttcacc aggaagaagc tggttcttcc 

20 4321 acaactctct caagcataaa agtggaggtg gcttcaagtc aggcagaaac caccaccctt 
4381 gatcaagatc atcttgaaac cactgtggct attctccttt ctgaaactag accacagaat 
4441 cacaccccta ctgctgcccg gatgaaggag ccagcatcct cgtccccatc cacaattctc 
4501 atgtctttgg gacaaaccac caccactaag ccagcacttc ccagtccaag aatatctcaa 
4561 gcatctagag attccaagga aaatgttttc ttgaattatg tggggaatcc agaaacagaa 

25 4621 gcaaccccag tcaacaatga aggaacacag catatgtcag ggccaaatga attatcaaca 
4681 ccctcttccg accgggatgc atttaacttg tctacaaagc tggaattgga aaagcaagta 
4741 tttggtagta ggagtctacc acgtggccca gatagccaac gccaggatgg aagagttcat 
4801 gcttctcatc aactaaccag agtccctgcc aaacccatcc taccaacagc aacagtgagg 
4861 ctacctgaaa tgtccacaca aagcgcttcc agatactttg taacttccca gtcacctcgt 

30 4921 cactggacca acaaaccgga aataactaca tatccttctg gggctttgcc agagaacaaa 
4981 cagtttacaa ctccaagatt atcaagtaca acaattcctc tcccattgca catgtccaaa 
5041 cccagcattc ctagtaagtt tactgaccga agaactgacc aattcaatgg ttactccaaa 
5101 gtgtttggaa ataacaacat ccctgaggca agaaacccag ttggaaagcc tcccagtcca 
5161 agaattcctc attattccaa tggaagactc cctttcttta ccaacaagac tctttctttt 

35 5221 ccacagttgg gagtcacccg gagaccccag atacccactt ctcctgcccc agtaatgaga 
5281 gagagaaaag ttattccagg ttcctacaac aggatacatt cccatagcac cttccatctg 
5341 gactttggcc ctccggcacc tccgttgttg cacactccgc agaccacggg atcaccctca 
5401 actaacttac agaatatccc tatggtctct tccacccaga gttctatctc' ctttataaca 
5461 tcttctgtcc agtcctcagg aagcttccac cagagcagct caaagttctt tgcaggagga 

40 5521 cctcctgcat ccaaattctg gtctcttggg gaaaagcccc aaatcctcac caagtcccca 
5581 cagactgtgt ccgtcaccgc tgagacagac actgtgttcc cctgtgaggc aacaggaaaa 
5641 ccaaagcctt tcgttacttg gacaaaggtt tccacaggag ctcttatgac tccgaatacc 
5701 aggatacaac ggtttgaggt tctcaagaac ggtaccttag tgatacggaa ggttcaagta 
5761 caagatcgag gccagtatat gtgcaccgcc agcaacctgc acggcctgga caggatggtg 

45 5621 gtcttgcttt cggtcaccgt gcagcaacct caaatcctag cctcccacta ccaggacgtc 
5881 actgtctacc tgggagacac cattgcaatg gagtgtctgg ccaaagggac cccagccccc 
5941 caaatttcct ggatcttccc tgacaggagg gtgtggcaaa ctgtgtcccc cgtggagagc 
6001 cgcatcaccc tgcacgaaaa ccggaccctt tccatcaagg aggcgtcctt ctcagacaga 
6061 ggcgtccata agtgcgtggc cagcaatgca gccggggcgg acagcctggc catccgcctg 

50 6121 cacgtggcgg cactgccccc cgttatccac caggagaagc tggagaacat ctcgctgccc 
6181 ccggggctca gcattcacat tcactgcact gccaaggctg cgcccctgcc cagcgtgcgc 
6241 tgggtgctcg gggacggtac ccagatccgc ccctcgcagt tcctccacgg gaacttgttt 
6301 gttttcccca acgggacgct ctacatccgc aacctcgcgc ccaaggacag cgggcgctat 
6361 gagtgcgtgg ccgccaacct ggtaggctcc gcgcgcagga cggtgcagct gaacgtgcag 

55 6421 cgtgcagcag ccaacgcgcg catcacgggc acctccccgc ggaggacgga cgtcaggtac 
6481 ggaggaaccc tcaagct'gga ctgcagcgcc tcgggggacc cctggccgcg catcctctgg 
6541 aggctgccgt ccaagaggat gatcgacgcg ctcttcagtt ttgatagcag aatcaaggtg 
6601 tttgccaatg ggaccctggt ggtgaaatca gtgacggaca aagatgccgg agattacctg 
6661 tgcgtagctc gaaataaggt tggtgatgac tacgtggtgc tcaaagtgga tgtggtgatg 

60 6721 aaaccggcca agattgaaca caaggaggag aacgaccaca aagtcttcta cgggggtgac 
6781 ctgaaagtgg actgtgtggc caccgggctt cccaatcccg agatctcctg gagcctccca 
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6841 gacgggagtc tggtgaactc cttcatgcag tcggatgaca gcggtggacg caccaagcgc 
6901 tatgtcgtct tcaacaatgg gacactctac tttaacgaag tggggatgag ggaggaagga 
6961 gactacacct gctttgctga aaatcaggtc gggaaggacg agatgagagt cagagtcaag 
7021 gtggtgacag cgcccgccac catccggaac aagacttact tggcggttca ggtgccctat 
7081 ggagacgtgg tcactgtagc ctgtgaggcc aaaggagaac ccatgcccaa ggtgacttgg 
7141 ttgtccccaa ccaacaaggt gatccccacc tcctctgaga agtatcagat ataccaagat 
7201 ggcactctcc ttattcagaa agcccagcgt tctgacagcg gcaactacac ctgcctggtc 
7261 aggaacagcg cgggagagga taggaagacg gtgtggattc acgtcaacgt ccagccaccc 
7321 aagatcaacg gtaaccccaa ccccatcacc accgtgcggg agatagcagc cgggggcagt 
7381 cggaaactga ttgactgcaa agctgaaggc atccccaccc cgagggtgtt atgggctttt 
7441 cccgagggtg tggttctgcc agctccatac tatggaaacc ggatcactgt ccatggcaac 
7501 ggttccctgg acatcaggag tttgaggaag agcgactccg tccagctggt atgcatggca 
7561 cgcaacgagg gaggggaggc gaggttgatc gtgcagctca ctgtcctgga gcccatggag 
7621 aaacccatct tccacgaccc gatcagcgag aagatcacgg ccatggcggg ccacaccatc 
15 7681 agcctcaact gctctgccgc ggggaccccg acacccagcc tggtgtgggt ccttcccaat 
7741 ggcaccgatc tgcagagtgg acagcagctg cagcgcttct accacaaggc tgacggcatg 
7801 ctacacatta gcggtctctc ctcggtggac gctggggcct accgctgcgt ggcccgcaat 
7861 gccgctggcc acacggagag gctggtctcc ctgaaggtgg gactgaagcc agaagcaaac 
7921 aagcagtatc ataacctggt cagcatcatc aatggtgaga ccctgaagct cccctgcacc 
20 7981 cctcccgggg ctgggcaggg acgtttctcc tggacgctcc ccaatggcat gcatctggag 
8041 ggcccccaaa ccctgggacg cgtttctctt ctggacaatg gcaccctcac ggttcgtgag 
8101 gcctcggtgt ttgacagggg tacctatgta tgcaggatgg agacggagta cggcccttcg 
8161 gtcaccagca tccccgtgat tgtgatcgcc tatcctcccc ggatcaccag cgag'cccacc 
8221 ccggtcatct acacccggcc cgggaacacc gtgaaactga actgcatggc tatggggatt 
8281 cccaaagctg acatcacgtg ggagttaccg gataagtcgc atctgaaggc aggggttcag 
8341 gctcgtctgt atggaaacag atttcttcac ccccagggat cactgaccat ccagcatgcc 
8401 acacagagag atgccggctt ctacaagtgc atggcaaaaa acattctcgg cagtgactcc 
8461 aaaacaactt acatccacgt cttctgaaat gtggattcca gaatgattgc ttaggaactg 
8521 acaacaaagc ggggtttgta agggaagcca ggttggggaa taggagctct taaataatgt 
30 8581 gtcacagtgc atggtggcct ctggtgggtt tcaagttgag gttgatcttg atctacaatt 
8641 gttgggaaaa ggaagcaatg cagacacgag aaggagggct cagccttgct gagacacttt 
8701 cttttgtgtt tacatcatgc caggggcttc attcagggtg tctgtgctct gactgcaatt 
8761 tttcttcttt tgcaaatgcc actcgactgc cttcataagc gtccatagga tatctgagga 
8821 acattcatca aaaataagcc atagacatga acaacacctc actaccccat tgaagacgca 
35 8881 tcacctagtt aacctgctgc agtttttaca tgatagactt tgttccagat tgacaagtca 
8941 tctttcagtt atttcctctg tcacttcaaa actccagctt gcccaataag gatttagaac 
9001 cagagtgact gatatatata tatatatttt aattcagagt tacatacata cagctaccat 
9061 tttatatgaa aaaagaaaaa catttcttcc tggaactcac tttttatata atgttttata 
9121 tatatatttt ttcctttcaa atcagacgat gagactagaa ggagaaatac tttctgtctt 
40 9181 attaaaatta ataaattatt ggtctttaca agacttggat acattacagc agacatggaa 
9241 atataatttt aaaaaatttc tctccaacct ccttcaaatt cagtcaccac tgttatatta 
9301 ccttctccag gaaccctcca gtggggaagg ctgcgatatt agatttcctt gtatgcaaag 
9361 tttttgttga aagctgtgct cagaggaggt gagaggagag gaaggagaaa actgcatcat 
9421 aactttacag aattgaatct agagtcttcc ccgaaaagcc cagaaacttc tctgcagtat 
45 9481 ctggcttgtc catctggtct aaggtggctg cttcttcccc agccatgagt cagtttgtgc 
9541 ccatgaataa tacacgacct gttatttcca tgactgcttt actgtatttt taaggtcaat 
9601 atactgtaca tttgataata aaataatatt ctcccaaaaa aaaaa . 

SEQIDNO:2 

Amino acid sequence of human ADLICAN encoded by the DNA sequence shown in SEQ ED 
50 NO:l. 

MPKRAHWGALSVVLILLWGHPRVAIACPHPCACWPSE 

NLGFNSIQALSETSFAGLTKLEIiLMIHGNEIPSIPDQALRDLSSLQVPKPSYNKLRVITG 
QTLQGLSNLMRLHIDHNKIBPIHPQAFNGLTSLRLLHLEGNLLHQLHPSTPSTPTFLDYF 
RLSTiRHLYLAENMTOTLPASMLRNMPIiLENLY 
55 CKKDKAYBGGQLCAMCFSPKKLYKHEIHKLKDMTCLKPSIESPLRQNRSRSIEEEQEQBE 
DGGSQtilLBKFQLPQWSISLNMTDEHGNMVNLVCDIKKPMDVYKIHLNQTDPPDIDINAT 
VALDFECPMTRENYEKLWKLIAYYSEVPVKLHRELMLSKDPRVSYQYRQDADEEALYYTG 
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\n^QlLAEPEWVMQPSIDIQLNRRQSTAKK\n^I^YYTQYSQTISTKDTRQA^ 
PSGAVQRDQTVLEGGPCQLSCNVKAS ESPS I FWVLPDGS I LKAPMDDPDS KFS I LSSGWL 
R I KS MEPSDSGL YQC I AQVRDEMDRMVYR VL VQS PSTQPABKDTVT IGKNPG ES VTLPCN 
ALAI PEAHLSWI LPNRRI INDLANTSHVYMLPNGTLS IPKVQVSDSGYYRCVAVNQQGAD 
5 HFTVGITVTKKGSGLPSKRGRRPGAKALSRVREDIVEDEGGSGMGDEENTSRJUiLHPKDQ 
EVFLKTKDDAINGDKICAKKGRRKLKLWKHSEKEP^^ 

ERWADIIAKVRGKNLPKGTEVPPLIKTTSPPSLSLEVTPPFPAVSPPSASPVQTVTSAEE 
SSADVPLLGEEEHVLGTISSASMGLEHHHNGVILVEPEVTSTPLEEVVDDI^EKTEEITS 
TEGDLKGTAAPTLISEPYEPSPTLHTIiDTVYEKPTHEETATEGWSAADVGSSPEPTSSEY 

1 0 EPPLDAVSLAESEPMQYFDPDLETKSQPDEDKMKEOTFAHLTPTPTIWVNDSSTSQLFED 
STIGEPGVPGQSHLQGI/IOTIHLVKSSLSTQDTLLIKKGMKEMSOT 
RSSESEGQESKSITLPDSTLGIMSSMSPVKKPAETTVGTLLDKDTTTVTTTPRQKVAPSS 
TMSTHPSRRRPNGRRRLRPNKFRHRHKQTPPTTFAPSETFSTQPTQAPDIKISSQVESSL 
VPTAWVDNTVOTPKQLEMEKNAEPTSKGTPRRKHGKRPNKHRYTPSWSSRASGSKPSPS 

1 5 PENIGlRNI\n , PSSETILLPRTVSLKTEGPYDSIjDYMTTTRKIYSSYPKVQETLPVTYKPT 
SDGKE I KDDVATNVDKHKSDILVTGES ITNAI PTSRSLVSTMGEFKEES S PVGFPGTPTW 
NPSRTAQPGRLQTDIPVTTSGENLTDPPLLKELEDVDFTSEFLSSLTVSTPFHQEEAGSS 
TTLSSIKVEVASSQAETTTLDQDHLETTVAILLSETRPQNHTPTAARMKEPASSSPSTIL 
MSLGQTTTTKPALPSPRISQASRDSKENVFLNYVGNPETEATPVNNEGTQHMSGPNELST 

20 PS S DRDAFNLS TKLELE KQVFG SRS L PRGPDSQRQDGRVHASHQLTR VP AKP I LPTATVR 
LPEMSTQSASRYFVTSQSPRHWTNKPEITTYPSGALPENKQFTTPRLSSTTIPLPLHMSK 
PS I PS KFTDRRTDQFNGYSKVFGNNNI PEARNPVGKPPS PRIPH YSNGRLPFFTNKTLS F 
PQLG VTRRPOI PTS PAPVMRERKVI PGS YNR I HSHS TFHLDFGPPAPPLLHTPQTTGS PS 
TNLQNIPMVSSTQSSISFITSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSP 

25 QTVSVTAETDTVFPCEATOKPKPF^WTKVSTGALMTPNTRIQRFEVLKNGTLVIRKVQ^ 
QDRGQYMCTASNLHGIJDRMVVLLSVTVQQPQI 

Q I S W I FPDRRVWQTVS P VES R I TLKENRTLS I KE AS FSDRGVYKCVASN AAGADSLAI RL 
HVAALPPVIHQEKLENISLPPGLSIHIHCTAKAAPLPSVRWVU3DGTQIRPSQFLHGNLF 
VFPNGTLY IRNLAPKDSGRYE CVAANLVGS ARRTVQLNVQRAAANARI TGTS PRRTDVRY 

30 GGTLKLDCSASGDPWPRILWRLPSKRMIDAIiFSFDSRIKVFANGTIjVVKSVTDKDAGDYL 
CVARNKVGDDYWLKOTVVMKPAKIEHKEENDHKVFYGGDLKVDCVATGIjPNPEISWSLP 
DGSLVNSFMQSDDSGGRTKRYWFNNGTLYFNEVGMREEGDYTCFAENQVGKDEMRVRVK 
WTAPATI RNKTYLAVQVPYGDWTVACEAKGEPMPKVTWLS PTNKVI PTSSEKYQI YQD 
GTLLIQKAQRSDSGNYTCLVRNSAGEDRKTVWIHVNVQPPKINGNPNPITTVREIAAGGS 

35 RKL I DCKAEGI PTPRVLWAFPEGWLPAP YYGNR I TVHGNGS LD I RS LRKSDS VQL VCMA 
RNEGGEARLIVQLTVTLEPMEKPIFHDPISEKITAMAGHTISLNCSAAGTPTPSLVWVLPN 
GTDLQSGQQLQRFYHKADGMLHISGLSSVDAGAYRCVARNAAGHTERLVSLKVGLKPEAN 
KQYHNLVS I INGETLKLPCTPPGAGQGRFSWTLPNGMHLEGPQTIiGRVSLLDNGTLTVRE 
ASVFDRGTYVCRMETBYGPSVTSIPVIVIAYPPRITSEPTPVIYTRPGNTVKLNCMAMGI 

40 PKADITWELPDKSHLKAGVQARLYGNRFLHPQGSLTIQHATQRDAGFYKCMAKNILGSDS 
KTTYIHVF 

SEQIDNO:3 

gi|38076521 |ref|XM_143254.3| Mus muscuius RIKEN cDNA 6530405F15 gene 
(6530405F15Rik),mRNA 

45 1 atgtgggagc tggggttcag ggcgagacag agggtgggat gggcagaagg gtccaggaaa 

61 aggaaagtac tggaggggag ttgggacaaa agcagcgacc aagggaacat cgcttcagtg 
121 actgaagcca ggcaaaagga gcgggaagga ttatatgtag cctgggacgc tttcataaac 
181 actgatgacg tgtttgtgca aagcaagcaa tttgaggaga aacgcctggg acgtcggaaa 
241 gaaggaaagc gcctccagtt ctctgaagag tcagtccccc agctagtgaa gactaagcct 

50 301 actaagcctt ttgctcccgt tggaagcaaa gaacgttcct tcaatcaggt gaaggctctc 
361 ctcagaagat ttcctgtctc tgcttatgtt acaagaggat tcaaaagcaa gacagaagag 
421 ctcaggatgc agaagagagg cagggaagtc agctgcttgc tgatctccct cactgccatc 
4 81 tgcctggtgg tcacccctgg gagcagggtc tgtcctcgcc gatgtgcctg ctatgtgccc 
541 acagaggtgc actgtacatt tcggtacctg acctccatcc cagacggcat cccagccaat 

55 601 gtggaacgag tcaatttagg gtataacagc ctcactagat tgacagaaaa tgacttttct 
661 ggcctgagca gactggagtt actcatgctg cacagcaatg gcattcacag agtcagtgac 
721 aagaccttct cgggcttgca gtccttgcag gtcttaaaaa tgagctataa caaagtccaa 
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781 ataattgaga aggatacttt gtatggactc aggagcttga cccggttgca cctggatcac 
841 aacaacattg agtttatcaa ccccgaggcg ttttacggac tcaccttgct ccgcttggta 
901 catctagaag gaaaccggct gacaaagctc catccagaca catttgtctc tttgagctat 
961 ctccagatat ttaaaacctc cttcattaag tacctgtact tgtctgataa cttcctgacc 
1021 tccctcccaa aagaaatggt ctcctctatg ccaaacctag aaagccttta cttgcatgga 
1081 aacccatgga cctgtgactg ccatttaaag tggttgtccg agtggatgca gggaaaccca 
1141 gatataataa aatgcaagaa ggaaagaatc ccctccagtc ctcagcagtg tcccctttgc 
1201 atgaacccca ggatctctaa aggcagatct attgctatgg ctccatctgg ctcgttcctg 
1261 tgtacaaagc caaccattga tccaccactg aagtcaaaaa gcctgggtat tcaggaggac 
1321 aatggatctg cctccgtctc acctcaagat ttcatagaac cttttggctc cttgtctttg 
1381 aacatgacag acctgtctgg aaataaggcc aatgtgatct gtagtatcca aaagccttct 
1441 aggacattac caattgcatt cactgaagaa aatgactaca tcatgctaaa tatgtcattt 
1501 tcaacaaatc ttgtgtgcag tgtcaattat aatcacatcc agccagtgtg gcaactcctg 
1561 gctttgtaca gtgactctcc tctgatatta gaaaggaagc cccagcatac tgagactcca 
1621 ctgctgtctc ccaaatatca acaggtggct cttaggcctg aagacacttt taccaacata 
1681 gaggctgatt tcaaagcaga tcctttttgg ttccaacaag aaaaaatttc cctgcagctc 
1741 aacagaactg ctaccacact tagcacatta cagatccaat tttccacgga tgctcaaatc 
1801 actttaccaa aggcagagat gagaccggtg aaacgcaaat ggaccatgat tctgatgatg 
1861 aacaatacca gactggaaca tactgttttg gttggcggca ctattgccct ggactgtccg 
1921 ggcaaaggtg acccttcacc tcacttggaa tgggttttag ctgatgggag taaagtgaga 
1981 gccccttatg ttagtgagga tgggcgaatc ctaatagaca aaaaggggaa gttggaactc 
2041 cagatggctg atacctttga tgcaggtctt taccattgca taagcaccaa tgatgctgat 
2101 gcagatattc tcacatacag gataactgtg gtagagccct atgtagaaaa caagcatgaa 
2161 aatggagctc tgcacacagt aattatgggt gagatactcg atcttccatg cctttccact 
2221 ggtattccag atgcttctat tagctggatt cttccccgga acactgtgtt ctctcagtca 
2281 tcaagagaca tgcaaattct taacaatggg accttaagaa tattacaggc tacaccaaaa 
2341 gatcaaggcc attaccgatg tgtagcagcc aacccatcag gggctgattt ttccagtttt 
2401 caagttt;cag tccaaatgaa aggtcaaagg acaattgagc atgacaggga catagatgga 
2461 tctggacttg aagaacccaa gcccagtgtt ctccttaagc agccaccatc tttgaaactc 
2521 cctgcatcat ctttgacagg gacagaggct ggaaaacaag tctctgggat acataagaaa 
2581 aacaaacata gagacttaac acatcgacgg cgtggggatt ccactctccg gagattcagg 
2641 gaacacagga ggcagctccc tctctctgct cggagaattg acccccaaca ctgggcagca 
2701 cttttagaaa aagcaaagaa gaattctgtt ctaaggaagc aagaaaatac cacagtaaag 
2761 ccaacgccac tggctattcc acttgtggaa ctcgctgggg aggaaaaaga cgcctccggc 
2821 ctgactcctc cagatgaaga attcacggtt ctgaaaacta aggcttttgg tgtcccagaa 
2881 aggtcaccaa ctgctgactc tagaccagta aatcatggct ttgtgacaag ttcagcttct 
2941 ggcacagaag tctcctccac cgtgaatccg caaacactac tacccacgca ccttcctgat 
3001 ttcaaattat ttaatgttgt ggacagtgca gctgtgtcaa agagtatgaa ccgacctgta 
3061 acaagcaaga tagaagatac aacacatcaa aacccaatca ttatctttcc atcagtagct 
3121 gaaattcaag attctgctca ggtggggaga acgtcttccc aaagtgcaca ccccgcaaca 
3181 gggggagcca tggctaccta tggctatacc accatgctta gtagcttcac caacaaagcc 
3241 aatacagtcc tgcagtcagc aaatccaaca gaaagttatg gacctcagat acctttaaca 
3301 gaagtaagta gagttagcag taataactcc ttggctcaca ctactaaaga tccaggcttc 
3361 tccaagcgcc cttcagattc ccacaccact gccccttctt tatttcaaac tcctagaaac 
3421 aacagtacag gtaacgtggg aagagagagg acaatttgga gcagagggcg agctataagt 
3481 ccatatagaa ctccagttct ccgtcggcat agacacagga ttgtgaggcc agcactcaag 
3541 ggacctgcta acagaaatat aagtcaagtt tcagccacgg agccccctgg gatgtgccga 
3601 acctgttctt ccacagaaag gctcaccatg gctacggcag cactgtcagt tacaggctca 
3661 tcccacacta ccctccccaa agctaacaat gttgggatta tttcagaaga gtctaccact 
3721 gtggtcaaga agccatcttt actattgaag aacaaacaag atgtagatat agagacaata 
3781 acaaccacta taaactattt cagaagtgaa agtacccaca tgactcccac tgaagcaagc 
3841 atgatttctg ctccaacatc catatccctg ggaaaaactc ctatagacac tagtggtcac 
3901 ctgagcatgc ctaggaccat ccaagctgga acagatttag tggtgacacc accactttcc 
3961 agtccactta gccaaccctc aataccaaca aaagcaacaa gcacaaaact ctcaagaaga 
4021 aaaattccct ggcacccaat ctttgcaaat aaccataaca aagaggggat gctaaagaat 
4081 ctgcatcagt ttggtttaca aaagaacaca gccactp.agc ctcctgaaaa agctcctctt 
4141 ttacccacag atcatggttc ctcctcacct tctacaacac tcttggcaag tctgacgcca 
4201 gctcagtctg caacaatggc tgccactcgg cgcaatggca ctgaagtgca aggtgccaga 
4261 agtctctctg cagggaaaga gcagcccttc atcaactcct tcctagtgct tcctagcacc 
4321 acaagaaaga gatctagtac attaagcttc ctgtcagtgg aaacccccac agtgacaact 
4381 cctcctgtta ttgcatctgc cattatctct gaaacccaag aagtacgatc caaaaaagca 
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4441 aaagaccaaa caaaggggtc tctgaagaac aggaaaggcc caaccatcac ccccaggcag 
4501 atttctggct atagcacata ctcagttcca acaacaactg atactccctt ggctttcagt 
4561 cattccccgg gaaaagtcac tggtaggact gtaagtacag ctgctcctca ctcagcagct 
4621 tctctcctgg gcataactga actgccccag aaatgcactc atacttcggg aaatataaca 
5 4661 gcttcggaaa caactctgtt gagcaaatca caggagagca ccgcaatgaa aagagcctcc 
4741 gccacaccac cactcctcag cagtggggca ccccgaatgc ctactccttc ccctcctccc 
4801 ttcactaagg ttgtggttac agacagtgag gtcccagcag ttttcaagat gatgtcaaat 
4661 aggatggtca ccatatatga atcttcaagg cacgatatag atctgcagca accctcagca 
4921 gaggctagcc ccaatcctga gatcttaact ggatccactg actttcccct ctctagtctg 

10 4981 ttgacctcca ctcctatgcc agcaccaaga gtggataaac cacaggattc tcaatggaag 
5041 ccttctccct ggccagaaaa caaatttcag ctcaggtcat actcagaaac cattgaaaag 
5101 ggcaaaaggc cagaaataag cctgtcaccc cacctcagct ttccagaggc cagcactcat 
5161 gccttgcatt ggaatgcaca gaggcatgca gaaaagagtg tctttgataa gaaacctgct 
5221 caaaacccaa cttccaaaca cctgccctat gactctctgc ctaagactat attgaagaaa 

15 5281 ccaagaataa ttggaggaaa ggctgcaagc tttactgttc caactaattc agatgtcctt 
5341 cttccttgcg aggctgttgg agacccaaag cccaccatcc actggaccag agtctcatca 
5401 ggacgtgaaa tatcccgagg gatacagaaa acccggtttc atgtgcttcc caatggcacc 
5461 ttgtccatcc agagggtcag cattcaggac cgtggacagt acctgtgcgc tgcctctaat 
5521 ccactaggtg tagaccacct tcatgtcact ctgtctgtgg tctcctaccc tgctaggatt 

20 5581 ctggagagtc atgtcaagga gatcacagct cattccggaa gtactgtgaa acttaagtgt 
5641 agagtagaag gtatgccaag acctacaatt tcctggatac tcgcaaacca aacagtggtc 
5701 tcggaaacac ccgagggaag ccggaaggtc tgggtgacac ccgatggaac gttgatcatc 
5761 cataatttga gtctttatga ccgtggtttt tacaagtgtg tagccaacaa cccatctggc 
5821 caggattcac tgttggttaa gatacaagtc atcacagctc cccctgttat tatagagcaa 

25 5881 aagagacaag ccatcgttgg cgttttaggt gaaagtttga aactgccctg cactgcaaag 
5941 ggaactcccc agcccagtgt tcattgggtc ctctatgatg ggactgaact aaaaccactg 
6001 cagttgactc attccaggtt tttcttgtat ccaaatggga ccctgtatat aagaaacatc 
6061 gtttcttcag tcaggggcac ttatgaatgc attgctacca gctcctcggg ctcagagaga 
6121 agggtagtga ttctaagagt agaagagcaa gagacagttc ccaggataga aactgcctcg 

30 6181 cagaaatgga cagaggtgaa tttgggtgag aaattactac tgaactgctc agctactggg 
6241 gatccgaaac ctacaataat ctggaagtta ccatccaagg ttgtcattga ccagtggcac 
6301 agaatgggca gccggatcca tgtctaccca aatggctcct tggttattgg atcggtgaca 
6361 gaaaaggatg gtggtgacta cttatgtgtg gcaagaaaca aaatgggaga tgacctggtc 
6421 ctgatgcatg tccgcctaag actgacacct gccaaaattg aacacaagca gcattttaag 

35 64 81 aagcaagtac tccatgggaa agatttccaa gttgactgca aggcttctgg ctcccctgtg 
6541 cctgaggtat cctggagttt gcctgatgga acagtggtga acaatgtagc acaagctgat 
6601 gacagtggct acaggaccaa gaggtacacc ctcttccaca acggaacctt gtatttcaac 
6661 aaagttggga tggcagagga aggagattat atttgctctg ctcagaacac cttagggaaa 
6721 gatgaaatga aagttcacct aacagttcta acagccatcc cacggataag gcaaaactac 

40 6781 aggagcaatg taaggatcaa ggctggagac acagctgtcc tggactgtga ggtcactggg 
6841 gaacccaagc caaatgtatt ttggttgctg ccttccaaca atgtcatttc attctccaat 
6901 gacaggttca tatttcatgc caatggaact ttgtccatca ataaagtgaa accgcttgac 
6961 tctgggaagt atgtgtgtgt agctcaaaat cctagtgggg atgacactaa gacatacaaa 
7021 ctggatattg tctctaggcc tccattaatc aatggtttgt atgcaaacaa aactgttatt 

45 7081 aaagccacag ccattcagca ctccaaaaaa cacttggact gcagagcaga tggggtccca 
7141 ccaccccaga tcacatggat tatgccagac aatattttcc tcacggctcc atactacgga 
7201 ggcagaatca ccgtgcatca aaatggaacc ttggaaattc ggaacataag gctttctgat 
7261 tctgcggatt tcacctgtgt ggttcggagc gaaggaggag agagtgtgtt ggtggtgcag 
7321 ttaaaagtac tggaaatgct gagaagacca acattcagaa acccattcaa tgaaaaagta 

50 7381 gttgcccagg ttggcaagcc tgtagcaatg aactgctctg tggatgggaa cccaacacct 
7441 gaaattatct ggatcttacc tgatggcaca caatttgcta atgggccaca aaattcccca 
7501 tatctgatgg caagcaatgg ttctctcatt gtttacaaag caactcggaa caagtcagga 
7561 aagtatcgct gtacagcaag aaataaggtt ggctaca'tcg agaaactcat cctgttagaa 
7621 attggacaga agccagtcat tctgacatac gaaccgggga tgataaagag cgccggtggg 

55 7681 gaatcgctat ccctgcattg tgtgtctgat ggaatcccta agccaaatgt caaatggact 
7741 acgccaggtg gccttgtaat cgacagacct caagtcggtg gaaaa'tacat attgcatgaa 
7801 aatggcacgt tggtcatcaa agaaacaaca gctcatgaca gaggaaatta tatctgtaag 
7861 gctcaaaaca gcgttggcca ggcagttatt agtgttcccg tgacgattgt ggcctaccct 
7921 ccccgaatta taaactacct tcccaggagc atgctcagga ggacaggaga ggcaatgcag 

60 7981 ctccactgtg tggccttggg agtccccaag ccacaaatca cctgggagac gccgggatac 
8041 tccctgctct caacggcgac agaaagaagg ccccacagaa gtgagatgct tcccctacaa 
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8101 ggtacgctgg tcattcagaa tctacgagcc tcagattctg gcgtctataa atgcagagca 
8161 cagaacgtgc ttggtgccga ttatgcaaca acttacatcc aggtcctctg acaggaaggg 
8221 agaaactgaa tggaacaaaa gccaacatct gcagacttta ttttttggaa gaagtttaat 
8281 caaaggcagc cataggcatg taaatgaatc cgaatacatt tacagtatta aatttacaat 
8341 ggacatgcag tgagacttgt aaataaatgc actgtgaact gattccgagt ttccatggat 
8401 ttcaaagcaa actcttaact taaggcactt tgattttgcc aacaaataat aaaaaacatt 
8461 aagagaaaat gatccgctac aaattaacaa atggcgaatg cacctgaatt ttcagtaaaa 
8521 agacctttct tccgctaaca gttgccagct gccttgtgtc tgttttccac caatgttaca 
8581 aacatcgcac acagaatgaa tggagacaac gtgaaagatt aggtttgcag cccgtgttaa 
8641 tctcaatgca caaatatttt gtcactggtt tacaaacatt ttgataaaac ctacagaaaa 
8701 caagcgcaga actgttctga ttatgattaa tagtttacca tttgttcaca ct 

SEQ ID NO: 4 

Amino acid sequence of mouse ADLICAN encoded by the DNA sequence shown in SEQ ID 
15 NO: 3. 

hWE^FRARQRVGWAEOSRKRKVLEQSWDKSSDQGNIASVTEARQKEREGLYVAWDAFIN 
TDDVFVQSKQFEEKRLGRRKEGKRLQFSEESVPQLVKTKPTKPFAPVQSKERSFNQVKAL 
LRRFPVSAYVTRGFKS KTEELRMQKRGREVSCLLI SIiTAI CLVVTPGSRVCPRRCACYVP 
TEVHCTFRYLTSIPDGIPANVERVNLGYNSLTRLTENDFSGLSRLELLMLHSNGIHRVSD 

20 KTFSGLQSLQVLKMSYNKVQIIEKimiYGLRSLTRLHLDHNNIEFINPEAFYGLTLLRLV 
HLEGNRLTKLHPDTFVSLSYLQIFKTSFIKYLYLSDNFLTSLPKEMVSSMPNLESLYLHG 
NPWTCDCHLKWLSEWMQGNPDIIKCKKERIPSSPQQCPLCMNPRISKGRSIAMVPSGSFL 
CTKPTIDPSLKSKSUSIQEDNGSASVSPQDFlEPFGSLSliNMTDLSGNKANVICSIQKPS 
RTLPIAFTEENDYIMLNMSFSTNLVCSVNYNHIQPWQLLiALYSDSPLILERKPQHTETP 

25 LLSPKYMVALRPEDTFTNIEADFKADPFWFQQEKISLQLNRTATTLSTLQIQFSTDAQI 
TLPKAEMRPVKRKWTMILMMNNTRLEHTVLVGGTIALDCPGKGDPSPHLEWVI*AIX3SK\^ 
APYVSEDGRILIDKKGKLELQMADTFDAGLYHCISTNDADADILTYRITVVEPYVENKHE 
NGALHTVIMGEILDLPCLSTGIPDASISWILPRNTVFSQSSRDMQILNNGTLRILQATPK 
DQGHYRCVAANPSGADFSSFQVSVQMKGQRTIEHDRDIDGSGLEEPKPSVLLKQPPSLKL 

30 PAS SLTGTEAGKQVSG IHKKNKHRI)LTHRRRGDSTLRRFREHRRQLPLSARRIDPQHWAA 
LLEKAKKNS\OiRKQENTTVKPTPIAIPLVELAGEEKDASGLTPPDEEFTVLKTKAFGVPE 
RSPTADSRPVNHGFVTSSASGTEVSSTVNPQTLLPTHLPDFKLFNVWSAAVSKSMNRPV 
TSKIEDTTHQNPIIIFPSVAEIOPSAQVGRTSSQSAHPATGGAMATYGYTTMLSSFTNKA 
NTVLQSANPTESYGPQIPLTEVSRVSSNNSLAHTTKDPGFSKRPSDSHTTAPSLFQTPRJM 

35 NSTGNVGRERTIWSRGRAISPYRTPVLRRHRHRIVRPALKGPANRNISQVSATEPPGMCR 
TCSSTERLTMATAALS\rTGSSHTTLPKAKnSTVGIISEESTTWKKPSLLLKN 
TTTINYFRSESTHMTPTEASMI S APTS I SLGKTP IDTSGHLSMPRTI QAGTDLWTPPLS 
S PLSQPS I PTKATS TKLSRRKI PWHP I FANNHNKEGMliKNLHQFGLQKNTATKPPBKAPL 
LPTDHGSSSPSTTLLASLTPAQSATMAATRRNGTEVQGARSLSAGKEQPFINSFLVLPST 

40 TOKRSSTLSFLSVETPTVTTPPVIASAIISETQEVRSKKAKDQTKGSLKNRKGPTITPRQ 
ISGYSTYSVPITTDTPIiAFSHSPGK\rrGRTVSTAAPHSAASLIiGITELPQKCTHTSGN 
ASETTLLSKSQESTAMKRASATPPLLSSGAPRMPTPSPPPFTKVVVTDSEVPAVFKMMSN 
RMVTIYESSRHDIDLQQPSAEASPNPEILTGSTDFPLSSLLTSTPMPAPRVDKPQDSQWK 
PSPWPENKFQLRSYSETIEKGKRPEISLSPHLSFPEASTHAIiHWNAQRHAEKSVFDKKPA 

45 QNPTSKHLPYDSLPKTILiCKPRI IGGKAASFTVPTNSDVLLPCEAVGDPKPTIHWTRVSS 
GREISRGIQKTRFHVLPNGTLSIQRVSIQDRGQYLCAASNPLGVDHLHVTLSVVSYPARI 
LESHVKEITAHSGSTVKLKCRVEGMPRPTISWILANQTWSETPEGSRKVWVTPDGTLII 
HNLSLYDRGFYKCVANNPSGQDSLLVKIQVITAPPVIIEQKRQAIVGVLGESLKLPCTAK 
GTPQPSVHWVLYDGTELKPLQLTHSRFFLYPNGTLYIRN1VSSVRGTYECIATSSSGSER 

50 RWILRVEEQETVPRIETASQKWTEVNLGEKLLLNCSATGDPKPTI IWKLPSKWIDQWH 
RMGSRIHVYPNGSLVIGSVTEKDGGDYLCVARNKMGDDLVLMHVRLRLTPAKIEHKQHFK 
KQVLHGKDFQVDCKASGSPVPEVSWSLPDGTVVNNVAQADDSGYRTKRYTLFHNGTLYFN 
KVGMAEEGDYI CSAQNTLGKDEMKVHLTVLTAI PRIRQN YRSNVRI KAGDTAVLDCEVTG 
EPKPNVFWLLPSNNVISFSNDRFIFHANGTLSINKVKPLDSGKYVCVAQNPSGDDTKTYK 

55 I^IVSRPPLINGLYANKTVIKATAIQHSKKHIiDCRADGVPPPQITWIMPDNIFliTAPYYG 
GRITVHQNGTLEIRNIRLSDSADFTCVVRSEGGESVLVVQLKVLEMIiRRPTFRNPFNEKV 
VAQVGKP VAMNCSVDGNPTPB 1 1 W I LPDGTQFANGPQNS P YLMASNGSL I VYKATRNKSG 
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KYRCTARNKVGYIEKLILLEIGQKPVILTYEPGMIKSAGGBSLSLHCVSDGIPKPNVKWT 
TPGGLVI DRPQ VGGKY I IiHENGTLVIKBTTAHDRGNY I CKAQNS VGQAVI SVPVTI VAYP 
PRIINYLPRSMLRRTGEAMQIJiCVALGVPKPQITWETPGySLI#STATERRPHRSEMLPIjQ 
GTLVIQNLRASDSGVYKCRAQKVLGADYATTYIQVL 

SEQ1DN0:5 

gi|33355470|gb|AY273816.1| Rattus norvegicus bone specific CMF608 mRNA, complete cds 

1 cgagagacga cagaaggtta cggctgcgag aagacgacag aagggtccag aaaaaggaaa 
61 gtgctggagg ggagtgggga caaaagcagc gaccaagtga atgtcacttc agtgactgag 
121 gccaggcaaa acgcgcggga aggattttgt gtagcttggg accctttcat agacactgat 
181 gacacgttta cgcaaaatag aaatttgagg agaaacgcct gggccttcgg aaaggagtga 
241 ttgattagta cttgcaagtt taggtgactt taaggagaac taactaatgt atactattga 
301 gggaggagga agagcattac agagtttcca gcagcagcag gaaagctttg gttaatttgg 
361 aaatggatga tagcattaaa ataacagaag cgcctccagg tctctgaagc ttcagtcccc 
421 cagctgaaag ccagaaaaga ctaagcccac taagcctttt gatccctttg gaagcaaaga 
481 actttccttc cctggggtga agactctcct cagaagattt cctgtctctg cctatgttac 
541 aagaggaatc aaaaccaaga cagaagagct caggatgcag gtgagaggca gggaagtcag 
601 cggcttgttg atctccctca ctgctgtctg cctggtggtc acccctggga gcagggcctg 
661 tcctcgccgc tgtgcctgct atgtgcccac agaggtgcac tgtacatttc ggtacctgac 
721 ctccatccca gatggcatcc cggccaatgt ggaacgaata aatttaggat ataacagcct 
781 tactagattg acagaaaacg actttgatgg cctgagcaaa ctggagttac tcatgctgca 
841 cagtaatggc attcacagag tcagtgacaa gaccttctcg ggcttgcagt ccttgcaggt 
901 cttaaaaatg agctataaca aagtccaaat cattcggaag gatactttct acggactcgg 
961 gagcttggtc cggttgcacc tggatcacaa caacattgaa ttcatcaacc ctgaggcctt 
1021 ttatggactt acctcgctcc gcttggtaca tttagaagga aaccggctca caaagctcca 
1081 tccagacaca tttgtctcat taagctatct ccagatattt aaaacctctt tcattaagta 
1141 cctgttcttg tctgataact tcctgacctc cctcccaaaa gaaatggtct cctacatgcc 
1201 aaacctagaa agcctgtatt tgcatggaaa cccatggacc tgtgactgcc atttaaagtg 
1261 gttgtctgag tggatgcagg gaaacccaga tataataaaa tgcaagaaag acagaagctc 
1321 ttccagtcct cagcaatgtc ccctttgcat gaaccccagg atctctaaag gcagaccctt 
1361 tgctatggta ccatctggag ctttcctatg tacaaagcca accattgatc catcactgaa 
1441 gtcaaagagc ctggttactc aggaggacaa tggatctgcc tccacctcac ctcaagattt 
1501 catagaaccc tttggctcct tgtctttgaa catgacagac ctgtctggaa ataaggccga 
1561 catggtctgt agtatccaaa agccatcaag gacatcacca actgcattca ctgaagaaaa 
1621 tgactacatc atgctaaatg cgtcattttc cacaaatctt gtgtgcagtg tagattataa 
1681 tcacatccag ccagtgtggc aacttctggc tttatacagt gactctcctc tgatactaga 
1741 aaggaagccc cagcttaccg agactccttc actgtettct agatataaac aggtggctct 
1801 taggcctgaa gacattttta ccagcataga ggctgatgtc agagcagacc ctttttggtt 
1861 ccaacaagaa aaaattgtct tgcagctgaa cagaactgcc accacactta gcacattaca 
1921 gatccagttt tccactgatg ctcaaatcgc tttaccaagg gcggagatga gagcggagag 
1981 actcaaatgg accatgatcc tgatgatgaa caatcccaaa ctggaacgca ctgtcctggt 
2041 tggcggcact attgccctga gctgtccagg caaaggcgac ccttcacctc acttggaatg 
2101 gcttctagct gatgggagta aagtgagagc cccttacgtt agcgaggatg ggcgaatcct 
2161 aatagacaaa aatgggaagt tggaactgca gatggctgac agctttgatg caggtcttta 
2221 ccactgcata agcaccaatg atgcagatgc ggatgttctc acatacagga taactgtggt 
2281 agagccctat ggagaaagca cacatgacag tggagtccag cacacagtgg ttacgggtga 
2341 gacgctcgac cttccatgcc tttccacggg tgttccagat gcttctatta gctggattct 
2401 tccagggaac actgtgttct ctcagccatc aagagacagg caaattctta acaatgggac 
2461 cttaagaata ttacaggtta cgccaaaaga tcaaggtcat taccaatgtg tggctgccaa 
2*521 cccatcaggg gccgactttt ccagttttaa agtttcagtt caaaagaaag gccaaaggat 
2581 ggttgagcat gacagggagg caggtggatc tggacttgga gaacccaact ccagtgtttc 
2641 ccttaagcag ccagcatctt tgaaactctc tgcatcagct ttgacagggt cagaggctgg 
2701 aaaacaagtc tccggtgtac ataggaagaa caaacataga gacttaatac atcggcggcg 
2761 tggggattcc acgctccggc gattcaggga gcataggagg cagctccctc tctctgctcg 
2821 gagaattgac ccgcaacgct gggcagcact tctagaaaaa gccaaaaaga attctgtgcc 
2881 aaaaaagcaa gaaaatacca cagtaaagcc agtgccactg gctgttcccc tcgtggaact 
2941 cactgacgag gaaaaggatg cctctggcat gattcctcca gatgaagaat tcatggttct 
3001 gaaaactaag gcttctggtg tcccaggaag gtcaccaact gctgactctg gaccagtaaa 
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3061 tcatggtttt atgacgagta tagcttctgg cacagaagtc tcaactgtga atccacaaac 
3121 actacaatct gagcaccttc ctgatttcaa attatttagt gtaacaaacg gtacagctgt 
3181 gacaaagagt atgaacccat ccatagcaag caaaatagaa gatacaacca accaaaaccc 
3241 aatcattatc tttccatcag tagctgaaat tcgagattct gctcaggcag gaagagcatc 
5 3301 ttcccaaagt gcacaccctg taacaggggg aaacatggct acctatggcc ataccaacac 
3361 atatagtagc tttaccagca aagccagtac agtcttgcag ccaataaatc caacagaaag 
3421 ttatggacct cagataccta ttacaggagt cagcagacct agcagtagtg acatctcttc 
3481 tcacactact gcagacccta gcttctccag tcacccttca ggttcacaca ccactgcctc 
3541 gtctttattt cacattccta gaaacaacaa tacaggtaac ttccccttgt ccaggcactt 

10 3601 gggaagagag aggacaattt ggagcagagg gagagttaaa aacccacata gaaccccagt 
3661 tctccgacgg catagacaca ggactgtgag gccagcaatc aagggacctg ctaacaaaaa 
3721 tgtgagccaa gttccagcca cagagtaccc tgggatgtgc cacacatgtc cttccgcaga 
3781 ggggctcaca gtggctactg cagcactgtc agttccaagt tcatcccaca gtgccctccc 
3841 caaaactaat aatgttgggg tcatagcaga agagtctacc actgtggtca agaaaccact 

15 3901 gttactattt aaggacaaac aaaatgtaga tattgagata ataacaacca ctacaaaata 
3961 ttccggaggg gaaagtaacc acgtgattcc tacggaagca agcatgactt ctgctccaac 
4021 atctgtatcc ctggggaaat ctcctgtaga caatagtggt cacctgagca tgcctgggac 
4081 catccaaact gggaaagatt cagtggaaac aacaccactt cccagccccc tcagcacacc 
4141 ctcaatacca acaagcacaa aattctcaaa gaggaaaact cccttgcacc agatctttgt 

20 4201 aaataaccag aagaaggagg ggatgttaaa gaatccatat caattcggtt tacaaaagaa 
4261 cccagccgca aagcttccca aaatagctcc tcttttaccc acaggtcaga gttccccctc 
4321 agattctaca actctcttga caagtccgcc accagctctg tctacaacaa tggctgccac 
4381 tcagaacaag ggcactgaag tagtatcagg tgccagaagt ctctcagcag ggaagaagca 
4441 gcccttcacc aactcctctc cagtgcttcc tagcaccata agcaagagat ctaatacatt 

25 4501 aaacttcttg tcaacggaaa cccccacagt gacaagtcct actgctactg catctgtcat 
4561 tatgtctgaa acccaacgaa caagatccaa agaagcaaaa gaccaaataa aggggcctcg 
4621 gaagaacaga aacaacgcaa acaccacccc caggcaggtt tctggctata gtgcatactc 
4681 agctctaaca acagctgata cccccttggc tttcagtcat tccccacgac aagatgatgg 
4741 tggaaatgta agtgcagttg cttatcactc aacaacctct cttctggcca taactgaact 

30 4801 gtttgagaag tacacccaga ctttgggaaa tacaacagct ttggaaacaa cgttgttgag 
4861 caaatcacag gagagtacca cagtgaaaag agcctcagac acaccaccac cactcctcag 
4921 cagtggggcg cccccagtgc ccactccttc cccacctcct tttactaagg gtgtggttac 
4 981 agacagcaaa gtcacatcag ctttccagat gacgtcaaat agagtggtca ccatatatga 
5041 atcttcaagg cacaatacag atctgcagca accctcagca gaggctagcc ccaatcctga 

35 5101 gatcataact ggaaccactg actctccctc taatctgttt ccatccactt ctgtgccagc 
5161 actaagggta gataaaccac agaattctaa atggaagccc tctccctggc cagaacacaa 
5221 at at cage tc aagtcatact ccgaaaccat tgagaagggc aaaaggccag cagtaagcat 
5281 gtccccccac ctcagccttc cagaggecag cactcatgcc tcacactgga atacacagaa 
5341 geatgeagaa aagagtgttt ttgataagaa acctggtcaa aacccaactt ccaaacatct 
* 40 5401 gccttacgtc tctctaccta agactctatt gaaaaageca agaataattg gaggaaaggc 
5461 tgcaagcttt acagttccag ctaattcaga cgtttttctt ccttgtgagg ctgttggaga 
5521 cccactgccc atcatccact ggaccagagt ttcatcagga cttgaaatat cccaagggac 
5581 acagaaaagc cggttccacg tgcttcccaa tggcaccttg tccatccaga gggtcagtat 
5641 teaggacegt ggacagtacc tgtgctctgc atttaatcca ctgggcgtag accattttca 

45 5701 tgtctctttg tctgtggttt tttacccggc aaggattttg gacagacatg tcaaggagat 
5761 cacagttcac tttggaagta ctgtggaact aaagtgcaga gtggagggta tgccgaggcc 
5821 tacggtttcc tggatacttg caaaccaaac ggtggtctca gaaaeggeca agggaagcag 
5881 aaaggtctgg gtaacacctg atggaacatt gatcatctat aatctgagtc tttatgatcg 
5941 tggtttttac aagtgtgtgg ccagcaaccc atctggccag gattcactgt tggttaagat 

50 6001 acaagtcatc acagctcccc ctgtcattat agagcaaaag aggcaageca tcgttggggt 
6061 tttaggtgga agtttgaaac tgccctgcac tgcaaaagga actccccagc ctagtgttca 
6121 ctgggtcctt tatgatggga ctgaactaaa accattgeag ttgactcatt ccagattttt 
6181 cttgtatcca aatggaactc tgtatataag aagcatcget ccttcagtga ggggcactta 
6241 tgagtgcatt gccaccagct cctcaggctc agagagaagg gtagtgattc ttactgtgga 

55 6301 agagggagag acaatcccca ggatagaaac tgcctctcag aaatggactg aggtgaattt 
6361 gggtgagaaa ttactactga actgctcagc tactggggat ccaaagccta gaataatctg 
6421 gaggctgeca tccaaggctg tcatcgacca gtggcacaga atgggcagcc gaatccacgt 
6481 ctacccaaat ggatccttgg tggttgggtc agtgacggaa aaagacgctg gtgactactt 
6541 atgtgtggca agaaacaaaa tgggagatga cctagtcctg atgcatgtcc gectgagatt 

60 6601 gacacctgcc aaaattgaac agaagcagta ttttaagaag caagtgctcc atgggaaaga 
6661 tttccaagtt gaetgeaagg cctctggctc ccctgtgcct gaggtatcct ggagtttgcc 
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6721 tgatgggaca gtgctcaaca atgtagccca agctgatgac agtggctata ggaccaagag 
6781 gtacaccctt ttccacaatg gaaccttgta tttcaacaac gttgggatgg cagaggaagg 
6841 agattatatc tgctctgccc agaacacctt agggaaagat gagatgaaag tccacctaac 
6901 agttctaaca gccatcccac ggataaggca aagctacaag accaccatga ggctcagggc 
5 6961 tggagaaaca gctgtccttg actgcgaggt cactggggaa ccgaagccca atgtattttg 
7021 gttgctgcct tccaacaatg tcatttcatt ctccaatgac aggttcacat ttcatgccaa 
7081 tagaactttg tccatccata aagtgaaacc acttgactct ggggactatg tgtgcgtagc 
7141 tcagaatcct agtggggatg acactaagac atacaaactg gacattgtct ctaaacctcc 
7201 attaatcaat ggcctgtatg caaacaagac tgttattaaa gccacagcca ttcggcactc 

10 7261 caaaaaatac tttgactgca gagcagatgg gatcccatct tcccaggtca cgtggattat 
7321 gccaggcaat. attttcctcc cagctccata ctttggaagc agagtcacgg tccatccaaa 
7381 tggaaccttg gagatgagga acatccggct ttctgactct gcggacttca cctgtgtggt 
7441 tcggagcgag ggaggagaga gtgtgttggt agtgcagtta gaagtcctag aaatgctgag 
7501 aagaccaaca ttcagaaacc cattcaacga aaaagtcatc gcccaagctg gcaagcccgt 

15 7561 agcactgaac tgctctgtgg atgggaaccc cccacctgaa attacctgga tcttacctga 
7621 cggcacacag tttgctaaca gaccacacaa ttccccgtat ctgatggcag gcaatggctc 
7681 tctcatcctt tacaaagcaa ctcggaacaa gtcagggaag tatcgctgtg cagccaggaa 
7741 taaggttggc tacatcgaga aactcatcct gttagagatt gggcagaagc cagtcattct 
7801 gacatacgaa ccagggatgg tgaagagcgt cagtggggaa ccgttatcac tgcattgtgt 

20 7861 gtctgatggg atccccaagc caaatgtcaa gtggactaca ccgggtggcc atgtaatcga 
7921 caggcctcaa gtggatggaa aatacatact gcatgaaaat ggcacgctgg tcatcaaagc 
7981 aacaacagct cacgaccaag gaaattatat ctgtagggct caaaacagtg ttggccaggc 
8041 agttattagc gtgtcagtga tggttgtggc ctaccctccc cgaatcataa actacctacc 
8101 caggaacatg ctcaggagga caggggaagc catgcagctc cactgtgtgg ccttgggaat 

25 8161 ccccaagcca aaagtcacct gggagacgcc aagacactcc ctgctctcaa aagcaacagc 
8221 aagaaaaccc catagaagtg agatgcttca cccacaaggt acgctggtca ttcagaatct 
8281 ccaaacctcg gattccggag tctataagtg cagagctcag aacctacttg ggactgatta 
8341 cgcaacaact tacatccagg tactctgaca ggaaggggga gactaaaatt caacagaagt 
8401 ccacatccac agggtttatt ttttggaaga agtttaatca aaggcagcca taggcatgta 

30 8461 aatgagtctg aatacattta cagtattaaa tttacaatgg acatgcgatg agacttgtaa 
8521 atgaaagcat tgtgaactga aaccgagtct ctgtggatct caaagcaaac tcttaactta 
8581 aggcactttg attttgccaa caaataataa caaacattaa gagaaaaaaa tgatccacta 
8641 cgaaataaca aacggctaat gcacctgaat tctcagtaaa aagacctttc tctcgctaac 
8701 agttgccagc tgcctcgtgt ctgtttccta ccaatgtcac aaacatcgca cacagggtga 

35 8761 atggagtcaa cgggaaagat taagtttgcg gtctgtgtaa atctcaatgt acaaatattc 
8821 tgtcnctggt ttataaacat tttgataaaa ccgaaaaaaa aaaaaaaaaa aaaaaaaaaa 
8881 aaa 

SEQIDNO:6 

Amino acid sequence of rat ADLICAN encoded by the DNA sequence shown in SEQ ID 
40 NO: 5. 

MQVRQREVSGIiLISLTAVCLWTPGSRACPRRCACYVPTEVHCTFRYIiTSlPDGIPANVE 
RINLGYNSLTRLTENDFDQLSKLELLMLHSNGIHRVSDKTFSGIjQSLQVLKMSYNKVQII 
RKDTFYGLGSLVRLHLDHNNIEFINPEAFyGLTSLRLVHLEGNRLTKIiHPDTFVSLSYLQ 
I FKTSFI KYLFLSDNFLTSLPKBMVS YMPNLESLYLHGNPWTCDCHLKWLSEWMQGNPDI 

45 IKCKKDRSSSSPQQCPLCMNPRISKGRPFAMVPSGAFLCTKPTIDPSLKSKSLVTQEDNG 
SASTSPQDFIEPFGSLSLNMTDLSGNKADMVCSIOKPSRTSPTAFTEENDYIMLNASFST 
NLVCS VDYNHI QP VWQLLALYSDS PL I LERKPQLTETPS LS S RYKQ VALR P EDI FTS X EA 
DVRADPFWFQQEKIVLQLNRTATTLSTLQIQFSTDAQIALPRAEMRAERLKWTO 
PKLERTVLVGGTIALSCPGKGDPSPHLEWLLADGSKVRAPYVSEDGRILIDKNGKLELQM 

50 ADSFDAGLYHCI STNDADADVLTYRITVVEPYGESTHDSGVQHTVVTGETIiDLPCIiSTGV 
PDASISWILPGNTVFSQPSRDRQILNNGTLRILQVTPKDQGHYQCVAANPSGADFSSFKV 
SVQKKGQRMVEHDREAGGSGLGEPNSSVSLKQPASLKLSASALTGSEAGKQVSGVHRKNK 
HRDLIHRRRGDSTLRRFREHRRQLPLSARRIDPQRWAALLEKAKKNSVPKKQENTTVKPV 
PLAVPLVELTDEEKDASGMIPPDEEFMVLKTKASGVPGRSPTADSGPVNHGFMTSIASGT 

55 EVSTVNPQTLQSEHLPDFKLFSVTNGTANrTK^ 

DSAQAGRASSQSAHPVTGGNMATYGHTNTYSSFTSKASTVLQPINPTESYGPQIPITGVS 
RPSSSDISSHTTADPSFSSHPSGSHTTASSLFHIPRNNNTGNFPLSRHLGRERTIWSRGR 
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\HCOTHRTPVLRIUiRHRTVRPAIKGPANKNVSQVPA 
PSSSHSALPKTNNVGVIABESTTWKKPI^LFra^ 

EASMTS APTS VSLGKS PVDNSGHLSMPGTIQTGKDS VETTPLPS PLSTPS I PTSTKFSKR 
KTPLHQIFVNNQKKEGMLKNPYQFGLQKNPAAKLPKIAPLLPTGQSSPSDSTTLLTSPPP 
5 ALSTTMAATQNKGTEWSGARSLSAGKKQPFTNSSPVLPSTISKRSNTLNFLSTETPTVT 
SPTATASVIMSETQRTRSKEAKDQIKGPRKNRNNANTTPRQVSGYSAySALTTADTPLAF 
SHSPRQDDGGNVSAVAYHSTTSLIAITELFEKYTQTLGNTTALETTLLSKSQE 
SDTPPPLLSSGAPPVPTPSPPPFTKGVVTDSKVTSAFQMT^^ 

SAEASPNPBIITGTTDSPSNLFPSTSVPALRVDKPQNSKWKPSPWPEHKYQLKSYSETIE 

1 0 KGKRPAVSMSPHLSLPEASTHASHWNTQKHAEKSVFDKKPGQNPTSKHLPYVSLPKTLLK 
KPRI IGGKAASFTVPANSDVFLPCEAVGDPLP I IHWTRVSSGLEISQGTQKSRFHVLPNG 
TLSIQRVSIQDRGQYLCSAFNPLGVDHFHVSLSWFYPARILDRHVKEITVHFGSTVELK 
CRVEGMPRPTVSW ILANQTWSETAKGS RKVWVTPDGTLI I YNLSLYDRG FYKCVASNPS 
GQDSLLVKIQVITAPPVIIBQKRQAIVGVLGGSLKLPCTAKGTPQPSVHWVLYDGTBIiKP 

1 5 liQLTHSRFFLY PNGTL YI RS I APS VRGTYECI ATSSSGSERRWILTVEEGETI PRIETA 
SQKWTEVin^EKLLLNCSATGDPKPRIIWRLPSKAVIDQWHRMGSRIHVYPNGSLVVGSV 
TEKDAGD YLCVARNKMGDDLVLMHVRLRLTP AKI EQKQYFKKQVLHGKDFQVDCKASGS P 
VPEVS WS LPDGTVLNNV AQADDS G YRTKR YTLFHNGTLYFNNVGMAEEGD Y I CS AQNTLG 
IO>EMKVHLTVLTAIPRIRQSYKTTMRLRAGET^^ 

20 NDRFTFHANRTLS I H KVKPLDS GD YVCV AQNP SGDDTKTY KLD I VS KP PL I NGL Y ANKTV 
IKATAIRHSKKYFDCRAIX3IPSSQVTWIMPGNIFLPAPYFGSRVTVHPNGTLEMRNIRLS 
DSADFTCVVRSEGGESVLVVQLEVLEMLRRPTFRNPFNEKVIAQAGKPVALNCSVDGNPP 
PE ITW I LPDGTQFANRPHNS P YLMAGNGSfcl L YKATRNKSGKYRCAARNKVG YI EKLI LL 
EIGQKPVILTYEPGMVKSVSGBPLSLHCVSDGIPKPNVKWTTPGGHVIDRPQVDGKYILH 

25 ENGTLVI KATTAHDQGNY I CRAQNS VGQAVI S VS VMWAYPPR I INYLPRNMLRRTGEAM 
QLHCVALGIPKPKVTWETPRHSLLSKATARKPHRSEMU1PQGTLVIQNLQTSDSGVYKCR 
AQNLLGTDYATTYIQVL 

SEQIDNO:7 

gi|4502040|ref)NMJ)00693. 1| Homo sapiens aldehyde dehydrogenase 1 family, member A3 
30 (ALDH1 A3), mRNA 

1 agccggtgcg ccgcagacta gggcgcctcg ggccagggag cgcggaggag ccatggccac 
61 cgctaacggg gccgtggaaa acgggcagcc ggacgggaag ccgccggccc tgccgcgccc 
121 catccgcaac ctggaggtca agttcaccaa gatatttatc aacaatgaat ggcacgaatc 
181 caagagtggg aaaaagtttg ctacatgtaa cccttcaact cgggagcaaa tatgtgaagt 
35 241 ggaagaagga gataagcccg acgtggacaa ggctgtggag gctgcacagg ttgccttcca 
301 gaggggctcg ccatggcgcc ggctggatgc cctgagtcgt gggcggctgc tgcaccagct 
361 ggctgacctg gtggagaggg accgcgccac cttggccgcc ctggagacga tggatacagg 
421 gaagccattt cttcatgctt ttttcatcga cctggagggc tgtattagaa ccctcagata 
4B1 ctttgcaggg tgggcagaca aaatccaggg caagaccatc cccacagatg acaacgtcgt 
40 541 atgcttcacc aggcatgagc ccattggtgt ctgtggggcc atcactccat ggaacttccc 
601 cctgctgatg ctggtgtgga agctggcacc cgccctctgc tgtgggaaca ccatggtcct 
661 gaagcctgcg gagcagacac ctctcaccgc cctttatctc ggctctctga tcaaagaggc 
721 cgggttccct ccaggagtgg tgaacattgt gccaggattc gggcccacag tgggagcagc 
781 aatttcttct caccctcaga tcaacaagat cgccttcacc ggctccacag aggttggaaa 
45 841 actggttaaa gaagctgcgt cccggagcaa tctgaagcgg gtgacgctgg agctgggggg 
901 gaagaacccc tgcatcgtgt gtgcggacgc tgacttggac ttggcagtgg agtgtgccca 
961 tcagggagtg ttcttcaacc aaggccagtg ttgcacggca gcctccaggg tgttcgtgga 
1021 ggagcaggtc tactctgagt ttgtcaggcg gagcgtggag tatgccaaga aacggcccgt 
1081 gggagacccc ttcgatgtca aaacagaaca ggggcctcag attgatcaaa agcagttcga 
50 1141 caaaatctta gagctgatcg agagtgggaa gaaggaaggg gccaagctgg aatgcggggg 
1201 ctcagccatg gaagacaagg ggct/rttcat caaacccact gtcttctcag aagtcacaga 
1261 caacatgcgg attgccaaag aggagatttt cgggccagtg caaccaatac tgaagttcaa 
1321 aagtatcgaa gaagtgataa aaagagcgaa tagcaccgac tatggactca cagcagccgt 
1381 gttcacaaaa aatctcgaca aagccctgaa gttggcttct gccttagagt ctggaacggt 
55 1441 ctggatcaac tgctacaacg ccctctatgc acaggctcca tttggtggct ttaaaatgtc 
1501 aggaaatggc agagaactag gtgaatacgc tttggccgaa tacacagaag tgaaaactgt 
1561 caccatcaaa cttggcgaca agaacccctg aaggaaaggc ggggctcctt cctcaaacat 
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1621 cggacggcgg aatgtggcag atgaaatgtg ctggaggaaa aaaatgacat ttctgacctt 
1681 cccgggacac attcttctgg aggctttaca tctactggag ttgaatgatt gctgttttcc 
1741 tctcactctc ctgtttattc accagactgg ggatgcctat aggttgtctg tgaaatcgca 
1801 gtcctgcctg gggagggagc tgttggccat ttctgtgttt ccctttaaac cagatcctgg 
5 1861 agacagtgag atactcaggg cgttgttaac agggagtggt atttgaagtg tccagcagtt 
1921 gcttgaaatg ctttgccgaa tctgactcca gtaagaatgt gggaaaaccc cctgtgtgtt 
1981 ctgcaagcag ggctcttgca ccagcggtct cctcagggtg gacctgctta cagagcaagc 
2041 cacgcctctt tccgaggtga aggtgggacc attccttggg aaaggattca cagtaaggtt 
2101 ttttggtttt tgttttttgt tttcttgttt ttaaaaaaag gatttcacag tgagaaagtt 

10 2161 ttggttagtg cataccgtgg aagggcgcca gggtctttgt ggattgcatg ttgacattga 
2221 ccgtgagatt cggcttcaaa ccaatactgc ctttggaata tgacagaatc aatagcccag 
2281 agagcttagt caaagacgat atcacggtct accttaacca aggcactttc ttaagcagaa 
2341 aatattgttg aggttacctt tgctgctaaa gatccaatct tctaacgcca caacagcata 
2401 gcaaatccta ggataattca cctcctcatt tgacaaatca gagctgtaat tcactttaac 

15 2461 aaattacgca tttctatcac gttcactaac agcttatgat aagtctgtgt agtcttcctt 
2521 ttctccagtt ctgttaccca atttagatta gtaaagcgta cacaactgga aagactgctg 
2581 taataacaca gccttgttat ttttaagtcc tattttgata ttaatttctg attagttagt 
2641 aaataacacc tggattctat ggaggacctc ggtcttcatc caagtggcct gagtatttca 
2701 ctggcaggtt gtgaattttt cttttcctct ttgggaatcc aaatgatgat gtgcaatttc 

20 2761 atgttttaac ttgggaaact gaaagtgttc ccatatagct tcaaaaacaa aaacaaatgt 
2821 gttatccgac ggatactttt atggttacta actagtactt tcctaattgg gaaagtagtg 
2881 cttaagtttg caaattaagt tggggagggc aataataaaa tgagggcccg taacagaacc 
2941 agtgtgtgta taacgaaaac catgtataaa atgggcctat cacccttgtc agagatataa 
3001 attaccacat ttggcttccc ttcatcagct aacacttatc acttatacta ccaataactt 

25 3061 gttaaatcag gatttggctt catacactga attttcagta ttttatctca agtagatata 
3121 gacactaacc ttgatagtga tacgttagag ggttcctatt cttccattgt acgataatgt 
3181 ctttaatatg aaatgctaca ttatttataa ttggtagagt tattgtatct ttttatagtt 
3241 gtaagtacac agaggtggta tatttaaact tctgtaatat actgtattta gaaatggaaa 
3301 tatatatagt gttaggtttc acttctttta aggtttaccc ctgtggtgtg gtttaaaaat 

30 3361 ctataggcct gggaattccg atcctagctg cagatcgcat cccacaatgc gagaatgata 
3421 aaataaaatt ggatatttga ga 

SEQIDN0:8 

Amino acid sequence of human ALDH1A3 encoded by the DNA sequence shown in SEQ ID 
NO: 7. 

35 MATANGAVENGQPDGKPPALPRPIRNLEVKFTKIP1NNEWHESKSGKKFATCNPSTREQI 
CEVEEGD KPDVDKAVEAAQVAFQRG S PWRRLDALS RGRLLHQLADL VERDRATLAALETM 
DTGKPFLHAFPIDLEGCIRTLRYPAGWADKIQGKTIPTDDNVVCPTRHEPIGVCGAITPW 
NFPLLMLVWKLAPALCCGNTMVLKPAEQTPLTALYLGSLIKEAGPPPGVVNIVPGPGPTV 
GAAISSHPQINKIAFTGSTEVGKLVKEAASRSNLKRVTLEI^KNPCIVC^ADLDLAV^ 

40 CAHQGVFPNQGQCCTAASRVFVEEQVYSEFVRRSVEYAKKRPVGDPPDVKTEQGPQIDQK 
QFDKILELIESGKKEGAKLECGGSAMEDKGLF1KPTVFSEVTDNMRIAKEEIFGPVQPIL 
KFKSIEEVIKRANSTDYGLTAAVPTKNLDKALKLASALESGTVWINCYNALYAQAPFGGF 
KMSGNGRELGEYALAEYTE VKTVTI KLGDKNP 

SEQ ID NO: 9 

45 gi|9295527|gb|AF280404. 1 |AF280404 Mus musculus retinaldehyde dehydrogenase 3 
(Raldh3) mRNA, complete cds 

1 gagagtgcga accagttatg gctaccacca acggggctgt ggaaaacgga cagccggatg 

61 ggaaaccgcc tgccttgccg cgccccatcc gcaacttgga ggtcaagttc accaagatat 

121 ttatcaacaa cgactggcac gaatccaaga gtggaagaaa gtttgccaca tataaccctt 

50 181 caacactaga gaaaatatgt gaggtggaag aaggagataa gcccgatgtg gacaaggctg 

241 tggaggccgc tcaagctgcc ttccagcggg gatccccgtg gcgccggctg gatgcactga 

301 gcagaggcca gttgctgcat cagctggctg accttgtaga aagggaccgt gcgatcctgg 
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361 ctactctgga gaccatggac accggcaagc cattccttca tgcctttttc gtcgacctgg 

421 aaggctgtat taagaccttc agatattttg ccgggtgggc agacaaaatc cagggcagga 

481 ccatccccac agatgacaac gttgtgtgct tcaccaggca tgagcccatc ggggtgtgtg 

541 gggccattac accatggaac ttccccctgc tgatgctggc ctggaaactg gctcctgccc 

5 601 tgtgctgtgg gaacaccgtg gtcctgaagc cagctgagca gacccctctc acggctctgt 

661 acctagcctc tctcatcaaa gaggtcgggt tccctccggg tgtggtgaac attgtaccag 

721 gctttgggcc cactgtggga gcagcaattt cctcccatcc gcagatcaac aagatagcct 

781 tcaccggctc cacagaggtt ggaaagctgg tcagagaagc cgcctcccgg agcaacctga 

84 l agagggtcac actggagcta ggaggcaaga atccgtgcat cgtgtgtgca gatgctgact 

10 901 tggacttggc cgtcgagtgt gctcaccagg gagtgttctt caaccaaggt cagtgctgta 

961 cagcggcctc cagggtgttc gtggaagagc, aggtctacgg ggagtttgtg aggaggagtg 

1021 tggagttcgc caagaagagg ccggttggag accccttcga tgccaaaacg gagcaggggc 

1081 ctcagatcga ccaaaagcag tttgacaaaa tcctcgagct gattgagagt gggaagaagg 

1141 aaggggccaa gctagaatgt ggggggtcag ccatggagga cagagggctg ttcatcaaac 

15 1201 ccacggtctt ctcagatgtt acggacaaca tgaggattgc caaagaggag attttcggac 

1261 cagtgcagcc gatcctgaag ttcaaaaacc tggaggaggt gatcaagaga gcgaatagca 

1321 ccgactatgg actcacagca gcagtgttca ccaaaaacct ggacaaagca ctgaagctgg 

1381 ctgctgcgct cgagtcgggg acagtctgga tcaactgcta caatgcattt tatgcacagg 

1441 ctccatttgg tggcttcaaa atgtctggga atggcagaga actaggagaa tatgctctgg 

20 • 1501 ctgaatatac agaagtgaaa accgtcacca tcaaactcga ggagaagaac ccctgaggaa 

1561 cag 

SEQ ID NO: 10 

Amino acid sequence of mouse ALDH1A3 encoded by the DNA sequence shown in SEQ ID 
NO: 9. 

25 MATTNG A VENGQPDG KPPALPRP I RNLE VKFTKI P I NND WH ES KSGRKFAT YN PSTLEKI 
CBVEEQDKPDVDKAVEAAOAAFQRGSPWRRLDALSRGQLLHQLADLVERDRAILATLETM 
DTGKPFLHAPFVDLEGCIKTFRYFAGWADKIQGRTIPTDDNWCFTRHEPIGVCGA1TPW 
NFPLLMLAWKLAPALCCGNTVVLKPAEQTPLTALYLASLIKEVGFPPGVVNIVPGFGPTV 
GAAISSHPQINKIAFTGSTEVGKLVREAASRSNLKRVTLELGGKNPCIVCADADLDLiAVE 

30 CAHQGVFFNQGQCCTAASRVFVEEQVYGEFVRRSVEFAKKRPVGDPFDAKTEQGPOIDQK 
QFDKILELI ESGKKEGAKLECGGS AMEDRGLFI KPTVFSDVTDNMRI AKEE I FGPVQP IL 
KFKNLEEVI KRANSTD YGLTAAVFTKNLDKALKLAAALESGTVW INC YNAF YAQAP FGGF 
KMSGNGRELGEYALAE YTEVKTVTI KLEEKNP 

SEQ ID NO: 11 

35 gi|23463282|reflNM_153300.1| Rattus norvegicus aldehyde dehydrogenase family 1, 
subfamily A3 (Aldhla3), mRNA 

1 atggccaccg ctaacggggc cgtggaaaac ggacagcctg atgggaaacc gcctgccttg 
61 ccgcgcccca tccgcaactt ggaggtcaag ttcactaaga tatttatcaa caatgactgg 
121 cacgaaccca agagtggaag aaagtttgcc acatataacc cttcaactct agagaaaata 

40 181 tgtgaggtgg aagaaggcga taagcccgat gtggacaagg ccgtggaggc cgcccaagct 
241 gccttccaga ggggctcccc gtggcgcagg ctggatgccc tgagtcgtgg ccagttgttg 
301 caccagttgg ctgatcttat agaaagggac cgcgctatcc tggcaactct agagaccatg 
361 gacaccggca agccgttcct tcatgccttt ttcgtcgacc tggagggctg tattaagacc 
421 ttcagatatt ttgccgggtg ggcagacaaa atccagggca ggaccatccc cacagatgat 

45 481 aatgtcatgt gcttcaccag gcatgagccc attggggtgt gtggggccat tacaccatgg 
541 aacttccccc tgttgatgct ggcctggaaa ctggctcctg ccctgtgctg cgggaacacc 
601 gtggtcctga agccagctga gcagacgcct ctcactqccc tgtacctcgc ttctctcatc 
661 aaagaggtcg ggttccctcc aggtgtggtg aacattgtgc caggctttgg gcccacagtg 
721 ggagcagcaa tctcctctca tccacagatc aacaagatag ccttcaccgg ctccacagag 

50 781 gttggaaagc tggttaaaga agctgcctcc aggagcaacc tgaagcgggt cacactggag 
841 ctgggaggca ggaacccgtg catcgtgtgt gcggacgctg acctggactt ggctgtggag 
901 tgtgctcacc agggagtgtt cttcaaccaa ggccagtgct gcacagcggc ctccagggtg 
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961 tttgtggaag agcaggtcta cggggagttt gtgaggagga gtgtggagtt cgccaagaag 

1021 aggccagttg gagacccctt cgatgccaaa acggagcagg ggcctcagat cgaccaaaag 

1081 cagtttgaca aaatcctcga gctgatcgag agtgggaaga aggaaggggc caagctagaa 

1141 tgtggggggt cagccatgga ggacagaggg ctgttcatca aacccaccgt cttctcagac 

5 1201 gttacggaca acatgaggat tgccaaagag gagatttttg gaccagtgca accaatactg 

1261 aagttcaaaa acctggagga agtaatcaaa agagcgaaca gcactgacta tgggctcacg 

1321 gcggccgtgt tcaccaaaaa cctcgacaaa gccctgaagc tggcttccgc gctcgagtcg 

1381 gggacagtct gggtcaactg ctacaacgca ttctatgcac aggctccatt tggtggcttc 

1441 aaaatgtctg gaaatggcag agaactaggt gaatacgctt tggctgaata cacagaagtg 

10 1501 aaaactgtca ccatcaaact tgatgagaag aacccctga 

SEQ ID NO: 12 

Amino acid sequence of rat ALDH1 A3 encoded by the DNA sequence shown in SEQ ID 
NO:lL 

MATANGAVENGQPIXSKPPALPRPIIWLBVKFTKIFIl^WHEPKSGRKPATYNPSTLEKI 
1 5 CEVEEGDKPDVDKAVEAAQAAFQRGSPWRRLDALSRGQLLHQLADL I ERDRAILATLETM 
DTGKPFLHAFFVDLEGCIKTFRYFAGWADKIQGRTIPTDDNVMCPTRHEPIGVCGAITPW 
NFPLLMLAWKLAPALCCGNTWLKPAEQTPLTALYLAS L I KEVG FPPG WNI VPGFGPTV 
GAAISSHPQINKIAFTGSTEVGKLVKEAASRSNLKRVTLELGGRNPCIVCADADLDLAVE 
CAHQGVFFNQGQCCTAASRVFVEEQVYGEFVRRSVEFAKKRPVGDPFDAKTEQGPQIDQK 
20 QFDKI LELI ESGKKEGAKLECGGS AMEDRGLFIKPTVFSDVTDNMR I AKEEI FGPVQPIL 
KFKNLEEVIKRANSTDYGLTAAVFTKNLDKAIiKLASALESGTVWVN 
KMSGNGRELGEYAL7VEYTEVKTVTIKLDEKNP 

SEQ ID NO: 13 

gi|21687097|reflNM_145259.1| Homo sapiens activin A receptor, type IC (ACVR1C), 
25 mRNA 

1 ggtcaccgcc cggctgcggg gccagtggca ggagcgccac gcaccgccag ccgcaggggg 
61 cgtgggatgg gggcggccgg ggaggggggc gcccacactg actagagcca accgcgcact 
121 tcaaaagggt gtcggtgccg cgctcccctc ccgcggcccg ggaacttcaa agcgggccgt 
181 gctgccccgg ctgcctcgct ctgctctggg gcctcgcagc cccggcgcgg ccgcctggtg 
241 gcgatgaccc gggcgctctg ctcagcgctc cgccaggctc tcctgctgct cgcagcggcc 
301 gccgagctct cgccaggact gaagtgtgta tgtcttttgt gtgattcttc aaactttacc 
361 tgccaaacag aaggagcatg ttgggcatca gtcatgctaa ccaatggaaa agagcaggtg 
421 atcaaatcct gtgtctccct tccagaactg aatgctcaag tcttctgtca tagttccaac 
481 aatgttacca aaaccgaatg ctgcttcaca gatttttgca acaacataac actgcacctt 
541 ccaacagcat caccaaatgc cccaaaactt ggacccatgg agctggccat cattattact 
601 gtgcctgttt gcctcctgtc catagctgcg atgctgacag tatgggcatg ccagggtcga 
661 cagtgctcct acaggaagaa aaagagacca aatgtggagg aaccactctc tgagtgcaat 
721 ctggtaaatg ctggaaaaac tctgaaagat ctgatttatg atgtgaccgc ctctggatct 
781 ggctctggtc tacctctgtt ggttcaaagg acaattgcaa ggacgattgt gcttcaggaa 
841 atagtaggaa aaggtagatt tggtgaggtg tggcatggaa gatggtgtgg ggaagatgtg 
901 gctgtgaaaa tattctcctc cagagatgaa agatattggt ttcgtgaggc agaaatttac 
961 cagacggtca tgctgcgaca tgaaaacatc cttggtttca ttgctgctga caacaaagat 
1021 aatggaactt ggactcaact ttggctggta tctgaatatc atgaacaggg ctccttatat 
1081 gactatttga atagaaatat agtgaccatg gctggaatga tcaagctggc gctctcaatt 
1141 gctagtggtc tggcacacct tcatatggag attgttggta cacaaggtaa acctgctatt 
1201 gctcatcgag acataaaatc aaagaatatc ttagtaaaaa agtgtgaaac ttgtgccata 
1261 gcggacttag ggttggctgt gaagcatgat tcaatactga acaccatcga catacctcag 
1321 aatcctaaag tgggaaccaa gaggtatatg gctcctgaaa tgcttgatga tacaatgaat 
1381 gtgaatatct ttgagtcctt caaacgagct gacatctatt ctgttggtct ggtttactgg 
1441 gaaatagccc ggaggtgttc agtcggagga attgttgagg agtaccaatt gccttattat 
1501 gacatggtgc cttcagatcc ctcgatagag gaaatgagaa aggttgtttg tgaccagaag 
1561 tttcgaccaa gtatcccaaa ccagtggcaa agttgtgaag cactccgagt catggggaga 
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1621 ataatgcgtg agtgttggta tgccaacgga gcggcccgcc taactgctct tcgtattaag 

1681 aagactatat ctcaactttg tgtcaaagaa gactgcaaag cctaatgatg ataattatgt 

1741 taaaaagaaa tctctcatag ctttcttttc cattttcccc tttatgtgaa tgtttttgcc 

1801 attttttttt tgttctacct caaagataag acagtacagt atttaagtgc ccataaggca 

5 1861 gcatgaaaag ataactctaa agttaagcat gggcaggagt tgacttcatc caatctctat 

1921 gttatgttta attttatttt gaaagcaaca cctcaactca tctttttatt taataaggaa 

1981 gaaatatatt acaaaagtat aaaataagct ctataaaaat gttatagtca ttaagttttt 

2041 attttacttg aaccaagagc acatgaatga acaggaaaag atgtaaaaac atttttttct 

2101 gagatgaaaa catattaatt aaacatgcaa attagagcat gctatcttta ggtgatgcaa 

10 2161 tctatgtttc ccccttttta agttagcagg actttttaaa aataaatatt gctctaaact 

2221 ttaatatatc gaacgtgaga gtggagctgc ttagtggaag atgtaagtga ggtgggtgtc 

2281 ccatgtgctt ggtctcccct tctgctgttc tcctgttctt cataatccac tactgcagca 

2341 gtccctgaac cactaaactt gttcctttca tttacaaaag agatacctga catcctgaga 

2401 cactgagaaa tgtcctgaag tcacacagct aatggcagaa ctggcactag gtccaaatct 

15 2461 tgtgataatg aacaccgtaa ggttagctag cttcctactt tcccttgaat agtgcttttc 

2521 tccctatgta atatctttta ttatgatatt tgtggtttag aaggcatatt gagttatttt 

2581 gcagaatcat aatggacccg cacaaaatct cagaaccata tctgttgaca ttttttctca 

2641 tagaaatatc atggttaccc catttgttaa tgagcattaa tgttttctga acacttccaa 

2701 agattaatca aacataaata ttcattgtct gaaaatgtct ttaagataca attcagaggt 

20 2761 ccctatttcc tttgtacata cacacttaga aagaaaagac agaaaaggaa gaggaaggaa 

2821 ggaaatattt tgagaatata ttgagaagaa ttaagaaaac tcttcaatga agtgttaaca 

2881 accaaaccct acagacggta tcagaaacag caaatagata ttcctctacc ctttcacagt 

2941 gagtgagtga gtacagaaga atgctcatga tagttttgcc ttcattctac tttctgtgga 

3001 cacagagtaa tgaatattta atgggacatt aaatatgccc ttcaaatcta taattttact 

25 3061 ttggtaaacg agatttaaca tgatgtcttt tatgctccta aaacatcttt tttcaaactc 

3121 cattccttag aacattcttc tactgagatg atccaagacc aaaagtgttc tttggtactt 

3181 gcttataaag tgatagtaca tgttagcata taatgtattt tgaagagtga agtaaatgct 
3241 attgataaca gaaaaaaaaa aaaaaaa 

SEQ ID NO: 14 

30 Amino acid sequence of human ALK7 encoded by the DNA sequence shown in SEQ ID NO: 
13. 

MTRALCSALRQALliLLAAAAELSPGLKCVO^LCDSSNFTCQTEGACWASVMLTNQKEQVI 
KSCVSLPELNAQVFCHSSNNVTKTECCFTDFCNNITLHLPTASPNAPKLGPMELAI I ITV 
PVCLLSIAAMLTVWACQGRQCSYRKKKRPNVEEPLSECNLVNAGKTLKDLIYDVTASGSG 

35 SGLPLLVQRTIARTIVLQEIVGKGRFGEVWHGRWCGEDVAVKIFSSRDERYWFREAEIYQ 
TVMLRHENILGFIAADNKDNGTWTQLWLVSEYHEQGSLYDYLNRNIVTMAGMIKLALSIA 
SGIJ^LHMEIVGTQGKPAIAHRDIKSKNILVKKCETCAIADLGLAVKHDSILNTIDIPQN 
PKVGTKRYMAPEMLDDTMNVNIFESFKRADIYSVGLVYWEIARRCSVGGIVEEYQLPYYD 
MVPSDPS I EEMRKWCDQKFRPS IPNQWQSCEALRVMGRIMRECWYANGAARLTALRI KK 

40 TISQLCVKEDCKA 

SEQ ID NO: 15 

gi|38074652|ref]XM_l 94020.3| Mus musculus activin A receptor, type IC (Acvrlc), mRNA 

1 gccccgggaa cttcaaagcg gcccgcgctg cgggctgcgc tctgggaccc cgaagccttg 

61 caccgccgcc gggtggccat gaccccagcg cgcggctccg cactgagcct ggccctcctg 

45 121 ctggtggccc tggccgccga ccttgcggca ggactgaagt gtgtgtgtct tttgtgtgat 

181 tcttcaaact tcacctgcca aacggaagga gcatgttggg cctccgtcat gctaaccaac 

241 gggaaagagc aggtgatcaa atcgtgtgtg tccctcccgg aactaaatgc tcaggtcttc 

301 tgtcacagtt ccaacaacgt gaccaaaacc gaatgttgct tcacagactt ccgcaacaat 

361 atcacactgc accttcccac agcatcgcca aatgcccctc gacttggtcc cacagagctg 

50 421 acggttgtta tcaccgtgcc ggtttgcctc ctgtccatag ccgccatgct aacaatatgg 

481 gcgtgccagg accgccagtg cacatacagg aagactaaga gacacaacgt ggaggaagcg 

541 ctggctgagt acagccttgt gaatgcagga aaaacactca aggatctgat ctacgatgct 

601 actgcctctg gctcaggctc tggtctgcct ctcttggttc aaagaacaat cgcaaggaca 
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661 attgtacttc aagaaatcgt aggaaaaggt cggtttgggg aagtgtggca cggaagatgg 
721 tgtggagaag atgtggctgt gaaaatattc tcctccaggg atgagagatc ttggttccgt 
781 gaggcggaaa tttatcagac ggtgatgctg agacacgaga acatcctcgg tttcatcgca 
841 gctgacaaca aagataatgg aacttggaca caactctggc ttgtgtcaga gtatcacgag 
5 901 cagggctcct tgtatgacta tttgaacaga aacatagtga ctgtggctgg aatggtcaag 
961 ctggcgcttt ccatagcgag tggtctggct cacctgcaca tggagatcgt gggtactcaa 
1021 ggtaagcctg ctattgctca ccgagatata aagtcaaaga atatcctagt caaaaaatgt 
1081 gacacttgtg ccatagctga cttagggctg gctgtgaagc acgattctat catgaacact 
1141 atagacatac cccagaatcc taaagtggga accaagaggt acatggctcc cgaaatgctt 
10 1201 gatgatacaa tgaacctgag catctttgag tccttcaagc gagctgacat ctattcggtg 
1261 gggctggttt actgggaaat agctcgaagg tgttcagttg gaggagttgt tgaggagtac 
1321 cagttgcctt actatgacat ggtgccttca gatccttcga tagaagaaat gaggaaggtt 
.1381 gtctgtgatc agaaactccg accaaatctc ccaaaccagt ggcaaagctg tgaagcgctg 
1441 cgggtcatgg gaagaataat gcgtgagtgc tggtacgcca acggggcagc acgcctgaca 
15 1501 gccctgcgcg tgaagaagac catctcccag ctgtgtgtca aggaagactg taaagcttag 
1561 ctctgcgtac aggcaaagga agagtgctca cagctttcct tcccgtctcc cgctttgtgt 
1621 aaatgttttt gcttcttttc tgctgtgttt tattttagtt ctacctcaag gatgactcac 
1681 tacagtgttg aagtgtccca aaggcagcat gtaaagataa cgctaaagtt aagcatgggc 
1741 aggtcttgac tttcccagtt tccatgttgt ccttagtttt atgtttaaag gggacacgac 
20 1801 tcgttttttc tttttattta aggaggaaga tgttatgagt ataagataag ctacatacaa 
1861 tggtaaaaaa aatattaaat ttttgttgta cttgaaccaa gagcatccac aaatgagatg 
1921 aagacatatt taaaatgtga actactgtat attatggtga aatggaactg taaaaattaa 
1981 ctgggtcctt aaaaagtaaa ttttgctcag aattttaaca tctaaagagt ggaaaggtgg 
2041 agtaacaaag tgacaccacg ttctcaggac ttgctttctc tgcctatttg ccactctgca 
25 2101 tcactacgac agctgtccct aagtagttcg aaaaccacct gagccattac tcctcctgat 
2161 acataacaga ggtccatatg tctcaaacac tgtgaaatgg cccagagtca cagagctctt 
2221 gccagagctg gttacgaacc ctgtgacaac tagcactgtg aagtttgctg gccattatac 
2281 tcccttgaat tcccatagag cgtttcttcc caagtgggaa gatcaatcat cagttataat 
2341 attttaaatt tagtcacata gtgaatgcat tgagtatttt gtaagaccat aatagatttt 
30 2401 ctagaatttc agaaccaatc tattagaatt tcatcaaaga aatgtcatca ttatcccatt 
2461 cgataatgaa aaaaaatgtt ttctgaacat tgccaaatat tgaccaatta taaacatcca 
2521 ttttctaaag atgtatgtca taagatgtca ttgcttttga ggttcatggt tggaaagaaa 
2581 atgacagaaa gataaacacg gaaagaattt gaggagagtt cagagaactc tcccgaagga 
2641 gtgtaacccc ttactggtaa cagaagatca catcagtgat tacaaggcat tcctggcctt 
35 2701 ttaacaggga ccatgcatgg aagggtttgg aggatctacc tctattctcc atttttgaag 
2761 ttcagaatat ttaacaggac accagatatt tctttcaaat gtcctatagg ctttagtgaa 
2821 gatggtttaa ctcaacgtct ttcagtgttc tgaaacagtt tcccgaaccc cattccttag 
2881 aactttcatc tacctagaag ttgttagaat ggatgtgcac tctaacaagg tcttgaaaag 
2941 tgatatgtta acatattcta ttctttgata gttctagcag caaagacaca ttccatggat 
40 3001 atttttaagg gtaaatattt gattgtagcc tttctattat gagtaaggtg aagttttact 
3061 atctagcatc ttctaaagcc caacatggta aatgatgttg tgaagaatac gttatgtatc 
3121 ccataagagg gtaacttgat gtctacatat cactatggag atatgctact cgatccatct 
3181 catttgacta agtggagata agagaaacaa aagctgacag ctgtggagtt tgcatgtact 
3241 tccattctca gtcatgtcct catccattta taagcacagg gcaccccata acttacttgc 
45 3301 cacaatattg ggtacaagag ggaataacaa gatagatagg tcctgccctc ggggatttta 
3361 aagtctaatt tgggagtggg gaaagacatg tgagtatgag ggaagcgagt aaattgacag 
3421 ataaaatatg cagtgggatg cctttgagtt ctatgtgaca atggcttcta gaataggtct 
3481 taaaatcact tccaatttgc aacacattta gaagggatct ttatttagat attacagcct 
3541 aattatatca tcagcatcaa aaggtgcgcc tacgtgaatg gatgttaaga agaaccgtgc 
50 3601 gaagggagag ctggcttatg tctattatct tatttcacag tttctacatg acggcaatgg 
3661 aattatttac ttcaatgatt gtatccagat atccaatgaa aaaaacctac tctatcctgt 
3721 tccattcacc actgttaggg aagtatattt tctagatttt aaaagagggt gttgtaacat 
3781 ctttattttg gtcattatgt tttataattt atttcagggt atatacaaac ataacttajta 
3841 ttttcttggt ttagatttca cagtggagaa atctgcacat tggtacattt tgagactttt ;. 
55 3901 gctcattatt ctgttgctga actgtaatcc ctaagtgaat aatcttaaag agcagagcca 

396J. cagaagtagt cagggaacaa tagaagttgt agtttttgct taactctcaa catagttcta ' '* 
4021 catagatact gctcacttat aaaaaataga gtaattttta gagcataagg catatttact 
4081 gtatatatct atgcagttca attttgtatt atgtgttttt gatggaatct tatgtaacac 
4141 ttggtccttg taaattttgt tacgagcaaa aatatcttct taggttaggt gcaaagtgac 
60 4201 accattgctt tgctccttat ggatattttg gctctgggta taacatagag cttaggaatg 
4261 ctattatgaa tttttagtac tgtatgcaaa aaatggtggg gtttatatac agcatcactt 
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4321 cttgaaatac agaacgcatt atttgttgtc tacatttgag catgaatttg ttgccttata 
4381 agtcctgcgt tttgagtttg taatcagtac tgactctgtt gtatgcatcc ttccacgtct 
4441 aaaatatttg gcatgtcaca tctagaattc ttaatttatg ttctgccatg agagttaagt 
4501 gaaacatgac tgtcatgctc tattttaagc gcagcacttg cttttcatct ttatactttt 
4561 caattaactt tgcattttaa atttccataa ttgtatgaaa atagtaacct gatcgcaatg 
4621 tctgaagagt ttcaaatcgg cgtttatttt aaaatg 

SEQIDNO: 16 

Amino acid sequence of mouse ALK7 encoded by the DNA sequence shown in SEQ ID NO: 



mtparosalsIiALllvai^aadlaaqlkcvcllcdssnftcqtegacwasvmltngkeqvi 
kscvslpelnaqvfchssnnvtkteccftdfqwitlhlptaspnaprlgpteltwitv 
pvcllsiaamltiwacqdrqctyiuctkrhnvee^ 

sglpllvqrtiartivlqeivgkgrpgevwhgrwcgedvavkifssrderswfreaeiyq 

TVMLRHENILGFIAADNKDNGTWTQLWLVSEYHEQGSLTO 

SGLAHLHME I VGTQGKP AI AHRD I KSKNILVKKCDTCAIADLGLAVKHDS IMNTIDI PQN 
PKVGTKRYMAPEMIiDDTMNLSIFESFKRADIYSVGLVYWEIARRCSVGGVVEEYQLPYYD 
MVPSDPSIEEMRKWCDQKIAPNLPNQWQSCEALRVMGRIMRECWYANGAARLTALRVKK 
TISQLCVKEDCKA 

SEQIDNO: 17 

gi|20806 1 28|ref]NMJ 39090, 1 1 Rattus norvegicus activin receptor-like kinase 7 (Alk7), 
mRNA 

1 gggaggcccc gctgccacta gagccaaccg cgcacttccg aaggtgtcgc ggctggcctc 
61 ccctcccgcc gccccgggaa cttcaaagcg gcccgcgctg cgtgccgctc tgggaccccg 
121 aagccttgca ccgccgcggg gtggccatga ccccagcaag ccgctccgca ctgagcctgg 
181 ccctcctgct ggtggcactg gcctccgacc ttgcggcagg actgaagtgt gtgtgtcttt 
241 tgtgtgattc ctcaaacttt acctgccaaa ccgaaggagc atgctgggcc tctgtcatgc 
301 taaccaacgg gaaagaacag gtgatcaaat cgtgcgtgtc cctcccggaa ctaaatgctc 
361 aggtcttctg tcacagttcc aacaacgtga ccaagaccga atgttgcttc acagacttct 
421 gcaacaacat cactctgcac cttcccacag catctccaga tgcccctaga cttggcccca 
481 cagagctgac agttgttatc actgtacctg tttgcctcct gtccatcgca gccatgctaa 
541 cgatatgggc ctgccaggac cgccagtgca catacaggaa gaccaagaga cacaatgtgg 
601 aggaaccact ggcagagtac agccttgtca atgctggaaa aaccctcaaa gatctgattt 
661 atgatgccac tgcctcgggc tcaggatctg gcctgcctct tttggttcaa agaaccatcg 
721 caaggacaat tgtacttcaa gaaatcgtag gaaaaggtcg gtttggggaa gtgtggcacg 
781 gaagatggtg tggagaagat gtggctgtga aaatattctc ctccagagat gagagatctt 
841 ggttccgtga ggcagaaatt tatcagacgg taatgctgag acatgagaat attctcggtt 
901 tcatcgcggc cgacaacaaa gataatggaa cctggactca gctttggctt gtgtcagagt 
961 atcacgagca gggctcctta tatgactatt tgaatagaaa catagtgacc gtggctggaa 
1021 tggtcaagtt ggcgctttca atagcgagtg gtctggctca cctacacatg gagatcgtgg 
1081. gcactcaagg taagcctgct attgctcacc gagatataaa gtcaaagaat atcttagtca 
1141 aaaagtgtga cacttgtgcc atagctgact tagggctggc tgtgaaacat gattctatca 
1201 tgaacactat agatataccc cagaatccta aagtgggaac caagaggtat atggctcccg 
1261 aaatgcttga tgatacaatg aacgtcaaca tctttgagtc cttcaagcga gctgacatct 
1321 attcggtggg gctggtttac tgggaaatag ctcgaaggtg ttcagttgga ggacttgttg 
1381 aagagcacca gttgccttat tatgacatgg tgccttcaga tccttccata gaggaaatga 
1441 ggaaggtcgt ttgtgatcag aaactccgac caaatctccc aaaccagtgg caaagctgtg 
1501 aggcgctocg ggtcatggga agaataatgc gtgagtgctg gtatgccaac ggggcagctc 
1561 gcctgaccgc cctgcgcgtg aagaagacca tttctcagct gtgtgtcaag gaagactgta 
1621 aggcctaagg cgcatacagg cgacgggaaa gccctcacca ctctctttca tgtctcctgc 
1681 tttgtgtaaa tgttttcgtt tcttttctgc tgtgttttgt tttagttcta cctcaaagat 
1741 gattcactac agtgttgaag tgtccaaagg cagcatgaaa agataactct aagcatgggc 
1801 aggtcttgac tttcccagtt tccatgttgt ccgtactttt atttttaaag gtgacactat 
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1861 tcatttttct ttttatttaa ggaggaagat gttat 
SEQ ID NO: 18 

Amino acid sequence of rat ALK7 encoded by the DNA sequence shown in SEQ ID NO: 17. 

MTPASRSALSLALLLVAIASDLAAGLKCVCL^^ 
5 KSCVSLPBLNAQVFCHSSNNVTKTECCFTDFCNNITLHLPTASPDAPRLGPTELTVVITV 
PVCLI^IAAMLTIWACQDRQCTYRKTKRHNVEEPLAEYSLVNAQKTLKDLIYDATASGSG 
SGLPLLVQRT I ART I VLQE I VGKGRFGEVWHGRWCGEDVAVKI F S SRDER S WFREAE I YQ 
TVMLRHENILGFIAADNKDNGTWTQLWLVSEYHEQGSLYDYLNRNIVTVAGMVKIJUjSIA 
SGLAHl^n^EIVGTQGKPAIAHRDIKSKKILVKKCDTCAIADLGLAVKHDS 
1 0 PKVGTKRYMAPEMLDDTMNVNI FESFKRADI YSVGLVYWE I ARRCS VGGLVEEYQLP Y YD 
MVPSDPS1EEMRKWCTQKLRPNLPNQWQSCEALRVMGRIMRECWYANGAARLTALRVKK 
TISQLCVKEDCKA 

SEQ ID NO: 19 

gi|21314629|reflNM_004054.2| Homo sapiens complement component 3a receptor 1 
15 (C3AR1), mRNA 

1 actgtggcta agtgtgggga ccagacagga ctcgtggaga catccaggtg ctgaagcctt 
61 cagctactgt ctcagttttt tgaagtttag caatggcgtc tttctctgct gagaccaatt 
121 caactgacct actctcacag ccatggaatg agcccccagt aattctctcc atggtcattc 
181 tcagccttac ttttttactg ggattgccag gcaatgggct ggtgctgtgg gtggctggcc 
20 241 tgaagatgca gcggacagtg aacacaattt ggttcctcca cctcaccttg gcggacctcc 
301 tctgctgcct ctccttgccc ttctcgctgg ctcacttggc tctccaggga cagtggccct 
361 acggcaggtt cctatgcaag ctcatcccct ccatcattgt cctcaacatg tttgccagtg 
421 tcttcctgct tactgccatt agcctggatc gctgtcttgt ggtattcaag ccaatctggt 
481 gtcagaatca tcgcaatgta gggatggcct gctctatctg tggatgtatc tgggtggtgg 
25 541 cttttgtgat gtgcattcct gtgttcgtgt accgggaaat cttcactaca gacaaccata 
601 atagatgtgg ctacaaattt ggtctctcca gctcattaga ttatccagac ttttatggag 
661 atccactaga aaacaggtct cttgaaaaca ttgttcagcc gcctggagaa atgaatgata 
.721 ggttagatcc ttcctctttc caaacaaatg atcatccttg gacagtcccc actgtcttcc 
781 aacctcaaac atttcaaaga ccttctgcag attcactccc taggggttct gctaggttaa 
30 841 caagtcaaaa tctgtattct aatgtattta aacctgctga tgtggtctca cctaaaatcc 
901 ccagtgggtt tcctattgaa gatcacgaaa ccagcccact ggataactct gatgcttttc 
961 tctctactca tttaaagctg ttccctagcg cttctagcaa ttccttctac gagtctgagc 
1021 taccacaagg tttccaggat tattacaatt taggccaatt cacagatgac gatcaagtgc 
1081 caacacccct cgtggcaata acgatcacta ggctagtggt gggtttcctg ctgccctctg 
35 1141 ttatcatgat agcctgttac agcttcattg tcttccgaat gcaaaggggc cgcttcgcca 
1201 agtctcagag caaaaccttt cgagtggccg tggtggtggt ggctgtcttt cttgtctgct 
1261 ggactccata ccacattttt ggagtcctgt cattgcttac tgacccagaa actcccttgg 
1321 ggaaaactct gatgtcctgg gatcatgtat gcattgctct agcatctgcc aatagttgct 
1381 ttaatccctt cctttatgcc ctcttgggga aagattttag gaagaaagca aggcagtcca 
40 1441 ttcagggaat tctggaggca gccttcagtg aggagctcac acgttccacc cactgtccct 
1501 caaacaatgt catttcagaa agaaatagta caactg.tgtg aaaatgtgga gcagccaaca 
1561 agcaggggct cttaggcaat cacatagtga aagtttataa gaggatgaag tgatatggtg 
1621 agcagcggac ttcaaaaact gtcaaagaat caatccagcg gttctcaaac ggtacacaga 
1681 ctattgacat cagcatcacc tagaaacttg ttagaaatgc aaattctcaa gccgcatccc 
45 1741 agacttgctg aatcggaatc tctgggggtt gggacccagc aagggcactt aacaaaccct 
1801 cgtttctgat taatgctaaa tgtaagaatc attgtaaaca ttagttctat ttctatccca 
1861 aactaagcta tgtgaaa,taa gagaagctac tttgttttta aatgatgttg aatatttgtc 
1921 gatatttcca tcattaaatt tttccttagc attgtaaaaa aaaaaaaaaa aaaaaaaaaa 
1981 aaaaa 

50 SEQ ID NO: 20 
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Amino acid sequence of human C3AR1 encoded by the DNA sequence shown in SEQ ID 
NO: 19. 

MASFSAETNSTDLLSQPraEPPVILSMVILSLTFLLGLPGNGLVLWVAGLKMQRTVNTIW 
PLHLTLADLLCCLSI#PFSLAHLALC^QWPYGRFLCKLIPSIIVLNMFASVFLLTAISLDR 
5 CLWFKPIWCQNHRMVGMACS I CGCI WWAFVMCI PVFVYRBI FTTDNHNRCGYKFGLSS 
SLDyPDFYGDPLENRSLENIVQPPGEMKDRLDPSSFQTNDHPWTVPTVFQPQTFQRPSAD 
SliPRGSARLTSQNLYSNVFKPADWSPKIPSGFPIEDHETSPLDNSDAFLSTHLKLFPSA 
SSNSFYESELPQGFQDYYNLGQFTDDDQVPTPLVAITITRLWGFLLPSVIMIACYSFIV 
. FRMQRGRFAKSQS KTFRVAWWAVFLVCWTP YHI FGVLSLLTDPETPLGKTLMS WDHVC 
1 0 IALASANSCFNPFLYALLGKDFRKKARQSIQGILEAAFSEELTRSTHCPSNNVISERNST 
TV 

SEQ ID NO: 2i 

gi|6753223|ref|NM_009779.1| Mus musculus complement component 3a receptor 1 (C3arl), 
mRNA 

15 1 gaattccatc tcagtgtgct tgactgagcc atggagtctt tcgatgctga caccaattca 

61 actgacctac actcacggcc tctgtttcaa ccccaagaca ttgcctccat ggtcattctt 
121 ggtctcactt gtctattggg actgctaggc aatgggctgg tgctgtgggt agctggcgta 
181 aagatgaaga cgaccgtgaa cacagtctgg ttcctccatc tcaccctggc cgatttcctc 
241 tgctgcctct ccttgccctt ctccttggct cacctgattc tccaaggaca ctggccctat 
20 301 ggcttgttcc tgtgcaaact tatcccatcc atcattattc tcaacatgtt tgccagtgtc 
361 ttcctgctta ctgccattag cctggaccga tgtctgatag tacataagcc aatctggtgc 
421 cagaatcatc gaaacgtgag aaccgccttc gccatctgtg gatgtgtctg ggtggtagcc 
481 tttgtgatgt gtgtgcccgt atttgtatac cgtgatctgt tcattatgga caatcgcagt 
541 atatgtagat ataattttga ttcctccagg tcatatgatt attgggacta cgtgtacaaa 
25 601 ctaagtctac cagaaagcaa ttctactgat aactccactg ctcagctaac tggacatatg 
661 aatgacaggt cagctccttc ctctgtacag gcaagggatt acttttggac agttaccact 
721 gccctccagt cacagccatt cctaacatct cctgaagact cattctctct agattcagca 
781 aaccaacaac cccattatgg tggaaagcct cctaatgtcc tcacagccgc cgtacccagc 
841 gggtttcctg ttgaagatcg taaatccaat acactgaacg ctgacgcttt tctctctgct 
30 901 cacacagaac ttttccctac tgcttctagt ggtcatttat acccctatga tttccagggg 
961 gattatgttg accaattcac gtatgacaat catgtgccga caccgctgat ggcaataacc 
1021 atcacaaggc tggtggtggg cttcctggtg ccgtttttca tcatggtaat ttgttacagc 
1081 ctcatcgtct tcagaatgcg aaaaaccaac ttcaccaagt ctcggaacaa aacctttcgg 
. 1141 gtggctgtgg ctgtggtcac tgtctttttt atctgctgga ctccatacca tcttgtcgga 
35 1201 gtcctgctat tgattactga tccagaaagt tccttggggg aagctgtgat gtcctgggac 
1261 cacatgtcca ttgctttagc atctgccaat agttgcttca accctttcct gtatgccctc 
1321 ttggggaaag actttaggaa gaaagcaaga cagtctataa agggcattct ggaagcagcc 
1381 ttcagcgaag agctcacgca ctctaccaac tgtacccaag acaaagcctc ttcaaaaaga 
1441 aacaatatga gtacagatgt gtgaagatgt ggccctggga acctaagcag agttctcagg 
40 1501 tgaacagtga tggatgacat gtgagcagga cactttagac aatttggcga ctctcagaga 
1561 aaggtctctt attgacatca gcatcatttg aaaacattaa agatgcaaaa tttcaagccc 
1621 catcccagat gtgttgactc agaatctctg gcccatggga ccagtgtttt aacaggcctt 
1681 cttgtttcca tcagtgttaa gttttacctc atttggctta gtctattccc atccctgact 
1741 acaccatgtg caatgaataa ctttttcatc tgttttcagt attctttttt tttccttagc 
45 1801 atcatctaaa cttctagttt gcatggaagg ctgctcttat tgttctgaat ggaagatatt 
1861 catttattgt acagttttgt ggtggtgaca agtgattttt aagtggggaa agagacacag 
1921 taagaaaaga tctatgaaag cagggagtgt tgagttagag tttgacagaa cacagtgcca 
1981 aatgccaccc actaaaagca acctgagata attccagtgt tcatgtgagc aagtgagcac 
2041 agatacacat aaacactttc ctactcctgg agtgttttag aagttgtagc ttggagctct 
50 2101 ctcttccctg tgtattctgc ttgt'tcatag agccatctat gttccatttc tttttgtttg 
2161 ttggtttggc ttcttgtgtt ggttggttgg ttggttggtt ggttggttgg ttggttggtt 
2221 gcttggttgg ttggttggtt ggttggttgg ttgcttggtt gcttggttta ctgttttttc 
2281 aagacagtag ccctggctgt cctgaaactc attctgtaga ccaaactggc ctgaaactca 
2341 gagatcagtg tgcctctgcc tccaaggagt cttcttaagg tcttgaattc ctttctgttt 
55 2401 tccacacagg gccttgccat ctcctgctgg gattaggcgt gtgcaggtgt gcagtacgta 
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2461 cctggcccca tcgtcttcgg ctttttacat tagactttgt atctcacata gaataaaaca 
2521 atatcctaat gggatgatgg gtggaacttc aggatcaatg catgcttcaa cagcaatatc 
2581 aaaagatttt aaacatgagt ctaggtatct tacatgacaa aagtgtcagg cacactctct 
2641 tttttttctt ttcttttctt ttcttttttt ttttttcaaa ttaatgtttg cctggaccat 
5 2701 gctaatataa agtactgtat gcagccaatt ttaaaggaac taattaaaaa tgaaataaaa 
2761 aagggtggtt tctccagtgc ccagtagttc ccatctctcc ctactttgca agaattgtag 
2821 ctgttccttc tgatatttac catggtgtgc ctaaaacacc tcttatgctg ctgatctttg 
2881 tagacaggaa gaaaaggctg tcttgatttt gatttacttt acctacattt tcatccagca 
2941 ccccagtgtt gactggtatc acacattaat ctgttcgtga tcatttttct cttccctttg 

10 3001 ttgtcgggtt ttgcttgcct tcgtgccatc atctcagtgt agcacacaca atctagcttc 
3061 agactcccag gtctcttgtt atagtaacca gatgagcacc accaccatcc ccacccccag 
3121 ctgaccacac aattttcctg tcaatacatt tcaatagcat acagcttttc tctgtcctct 
3181 catttttcat gctgacatat attaattgta catattaata aggttccacg tggctgatcc 
3241 atccatgcac acaacattca cactttcacc cccagcctct tctttatcat tattctactt 

15 3301 actgattgac tttaaaaaaa aaaaaaaaa 

SEQ E> NO: 22 

Amino acid sequence of mouse C3AR1 encoded by the DNA sequence shown in SEQ ID 
NO: 21. 

MESTOMTNSTDLHSRPLFQPQDIASMVILGL^ 

20 FLHLTLADFLCCLSLPFSLAHLILQGHWPYGLF^ 

CLIVHKPIWCQNHRNVRTAFAICGCVWWAFVMCVPVFVYRDLFIMDNRSICRYNFDSSR 
SYDyWDYVyKLSLPESNSTDNSTAQLTQHMNDRSAPSSVQARDYFWTVTTALQSQPFLTS 
PEDSFSLDSANQQPHYGGKPPNVLTAAVPSGFPVEDRKSKTLNADAFLSAHTELFPTASS 
GHLYPYDFQGDYVDQFTYDNHVPTPLMAITITRLWGFLVPFFIMVICYSLIVFRMRKTN 

25 FTKSRNKTFRVAVAWTVFFICWTPYHLVGVLLLITDPESSLGEAVMSWDHMSIALASAN 
SCFNPFLYALLGKDFRKKARQSIKGILEAAFSEELTHSTNCTQDKASSKWINMSTDV 

SEQ ID NO: 23 

gi|14091739|ref|NM_032060.1| Rattus norvegicus complement component 3a receptor 1 
(C3arl), mRNA . 

30 1 tgtcgagtaa gcactcagca actagtccaa catcctccac ggcggctgga ccagatagga 

61 tttgttggtg gctcacagat catcagtcct ggagccttct gatcccacct cagtgctctt 
121 gactgagcaa tggagtcttt cactgctgac accaactcaa ctgacctaca ctcacggcct 
181 ctgtttaaac cccaagacat tgcctccatg gtcattctca gtctcacgtg tttattggga 
241 ctgccaggca atgggctggt gctgtgggtc gctggcgtaa agatgaagag gaccgtgaac 
35 301 acagtctggt tcctccacct caccctggcc gatttcctct gctgcctctc cttgcccttc 
361 tccgtggctc acctgattct ccgaggacac tggccctatg gcttgttcct gtgcaaactt 
421 atcccgtccg tcattatcct caacatgttt gccagtgtct tcctgcttac tgccattagc 
481 ctggaccgct gtctgatggt gcacaagccg atctggtgcc agaaccatcg gagcgtgaga 
541 acagccttcg ccgtctgtgg atgcgtctgg gtggtaacct ttgtgatgtg tataccggta 
40 601 tttgtatacc gtgatctgct cgttgtggat gattacagtg tatgtggcta taactttgat 
661 tcctccaggg catatgatta ttgggactac atgtacaatt cacatctacc agaaatcaac 
721 cctcctgaca actccactgg acatgtggat gacaggacag ctccttcctc ctctgtacca 
781 gcaagggacc tttggacagc taccactgct ctccagtcac aaacgttcca cacatctcct 
641 gaagacccat tctctcaaga ctcagcaagt cagcaacccc attatggggg gaagcctcct 
45 901 actgtgccca tagccaccat acccggcggg tttcctgttg aagatcataa atccaacaca 
961 ctgaacactg gtgcttttct ctctgcccac acagagccgt ccctcactgc ttctagcagt 
1021 cctttatacg cccatgattt cccggatgat tattttgacc aactgatgta tggcaatcac 
1081 gcgtggacac ctcaggtggc aatcaccatc tcgaggctgg tggtgggctt cctggtgccc 
1141 tttttcatca tgataacctg ttacagcctc atcgtcttca gaatgcgaaa aaccaacctc 
50 1201 accaagtcga ggaacaaaac ccttcgggtg gccgtggcgg tggtcaccgt cttttttgtc 
1261 tgctggattc cataccatat tgttggaatc ctgttagtga ttactgacca agaaagcgcc 
1321 ttgagagaag ttgtgctacc ctgggaccac atgtccattg ctttagcatc tgccaatagt 
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1381 tgtttcaacc ctttccttta tgccctcttg ggaaaagact tcaggaagaa agcaagacag 

1441 tccgtaaagg gcattctgga agcagccttc agcgaagagc tcacgcactc taccagttgt 

1501 acccaggaca aagccccctc aaaaagaaac catatgagta cggatgtgtg aagatgctgc 

1561 actggggacc taagcagatg ttctgaggtg aatactggtg atgggtgacc tgtgagcggg 

5 1621 acaccttaga cagcctggcc accctcagag aaaggtctct tattgacatc agcatcattt 

1681 gaaaacacta aaggcacaaa atttcaagcc ccatcccaga tgtttgactc cgaatctctg 

1741 gcccgtgggg accaatgtct taacaggccc ccttgtttcc accaatgtta agttttattc 

1801 aactcatttt attcccatcc ctgaatcggc catgtgcaat gaataacgtc ttcatctgtt 

1861 tocagtatta atctcttctt gcatagcatc atttaaatcc tttagtttgt gcgggaggct 

10 1921 gctattgtcc agagtgaaag atattttttt attgaacatt tgtgtggtgg tagcagtgat 

1981 tttttagtgg ggaggcaggg gagaaagaca cagaaataaa aggtttttgg aaaaaaaaaa 

2041 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa gaaaaaaaaa aaaaaaaaaa aaa 



SEQ ID NO; 24 

1 5 Amino acid sequence of rat C3AR1 encoded by the DNA sequence shown in SEQ ID NO: 23. 

MESFTADTNSTDIiHSRPLFKPQDIAShWILSLTCLLG 

FLHLTLADFLCCLSLPPSVAHLILRGHWPYGLFLCKLIPSVIILNMFASVPLLTAISLDR 
CLMVHKP I WCQNHRS VRTAFAVCGCVWW^ 

AYDYWDYMYNSHLPEINPPDNSTGHVDDRTAPSSSVPARDLWTATTALQSQTFHTSPEDP 
20 FSQDSASQQPHyGGKPPTVLIATIPGGFPVEDHKSNTLNTGAPLSAHTEPSLTASSSPLY 
AHD FPDD YFDQLMYGNHAWTPQVAI TI SRL WGFLVPF F I MI TC YSLI VFRMRKTNLTKS 
RNKTLRVAVAWTVFF VCW 1 P YH I VGI LLVITDQES ALRE WLPWDHMS I ALASANSCFN 
PFLYALLGKDFRKKARQSVKGILEAAFSEELTHSTSCTQDKAPSKRNHMSTDV 

SEQ ID NO: 25 



25 gi|3487869 1 |ref]NM_005795.2| Homo sapiens calcitonin receptor-like (CALCRL), mRNA 



1 gaacaacctc tctctctcca gcagagagtg tcacctcctg ctttaggacc atcaagctct 
61 gctaactgaa tctcatccta attgcaggat cacattgcaa agctttcact ctttcccacc 
121 ttgcttgtgg gtaaatctct tctgcggaat ctcagaaagt aaagttccat cctgagaata 
181 tttcacaaag aatttcctta agagctggac tgggtcttga cccctgaatt taagaaattc 
30 241 ttaaagacaa tgtcaaatat gatccaagag aaaatgtgat ttgagtctgg agacaattgt 
301 gcatatcgtc taataataaa aacccatact agcctataga aaacaatatt tgaaagattg 
361 ctaccactaa aaagaaaact actacaactt gacaagactg ctgcaaactt caatttgtca 
421 accacaactt gacaaggttg ctataaaaca agattgctac aacttctagt ttatgttata 
481 cagcatattt cattttggct taatgatgga gaaaaagtgt accctgtatt ttctggttct 
35 541 cttgcctttt tttatgattc ttgttacagc agaattagaa gagagtcctg aggactcaat 
601 tcagttggga gttactagaa ataaaatcat gacagctcaa tatgaatgtt accaaaagat 
661 tatgcaagac cccattcaac aagcagaagg cgtttactgc aacagaacct gggatggatg 
721 gctctgctgg aacgatgttg cagcaggaac tgaatcaatg cagctctgcc ctgattactt 
781 tcaggacttt gatccatcag aaaaagttac aaagatctgt gaccaagatg gaaactggtt 
40 841 tagacatcca gcaagcaaca gaacatggac aaattatacc cagtgtaatg ttaacaccca 
901 cgagaaagtg aagactgcac taaatttgtt ttacctgacc ataattggac acggattgtc 
961 tattgcatca ctgcttatct cgcttggcat attcttttat ttcaagagcc taagttgcca 
1021 aaggattacc ttacacaaaa atctgttctt ctcatttgtt tgtaactctg ttgtaacaat 
1081 cattcacctc actgcagtgg ccaacaacca ggccttagta gccacaaatc ctgttagttg 
45 1141 caaagtgtcc cagttcattc atctttacct gatgggctgt aattactttt ggatgctctg 
1201 tgaaggcatt tacctacaca cactcattgt ggtggccgtg tttgcagaga agcaacattt 
1261 aatgtggtat tattttcttg gctggggatt tccactgat.t. cctgcttgta tacatgccat 
1321 tgctagaagc ttatattaca atgacaattg ctggatcagt" tctgataccc atctcctcta 
1381 cattatccat ggcccaattt gtgctgcttt actggtgaat ctttttttct tgttaaatat 
50 1441 tgtacgcgtt ctcatcacca agttaaaagt tacacaccaa gcggaatcca atctgtacat 
1501 gaaagctgtg agagctactc ttatcttggt gccattgctt ggcattgaat ttgtgctgat 
1561 tccatggcga cctgaaggaa agattgcaga ggaggtatat gactacatca tgcacatcct 
1621 tatgcacttc cagggtcttt tggtctctac cattttctgc ttctttaatg gagaggttca 
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1681 agcaattctg agaagaaaet ggaatcaata caaaatccaa tttggaaaca gcttttccaa 
1741 ctcagaagct cttcgtagtg cgtcttacac agtgtcaaca atcagtgatg gtccaggtta 
1801 tagtcatgac tgtcctagtg aacacttaaa tggaaaaagc atccafcgata ttgaaaatgt 
1861 tctcttaaaa ccagaaaatt tatataattg aaaatagaag gatggttgtc tcactgtttt 
5 1921 gtgcttctcc taactcaagg acttggaccc atgactctgt agccagaaga cttcaatatt 
1981 aaatgacttt ttgaatgtca taaagaagag ccttcacatg aaattagtag tgtgttgata 
2041 agagtgtaac atccagctct atgtgggaaa aaagaaatcc tggtttgtaa tgtttgtcag 
2101 taaatactcc cactatgcct gatgtgacgc tactaacctg acatcaccaa gtgtggaatt 
2161 ggagaaaagc acaatcaact tttctgagct ggtgtaagcc agttccagca caccattgca 

10 2221 tgaattcaca aacaaatggc tgtaaaacta aacatacatg ttgggcatga ttctaccctt 
2281 attgccccaa gagacctagc taaggtctat aaacatgaag ggaaaattag cttttagttt 
2341 taaaactctt tatcccatct tgattggggc agttgacttt ttttttgccc agagtgccgt 
2401 agtccttttt gtaactaccc tctcaaatgg acaataccag aagtgaatta tccctgctgg 
2461 ctttcttttc tctatgaaaa gcaactgagt acaattgtta tgatctactc atttgctgac 

IS 2521 acatcagtta tatcttgtgg catatccatt gtggaaactg gatgaacagg atgtataata 
2581 tgcaatccta cttctatatc attaggaaaa catcttagtt gatgctacaa aacaccttgt 
2641 caacctcttc ctgtcttacc aaacagtggg agggaattcc tagctgtaaa tataaatttt 
2701 gtcccttcca tttctactgt ataaacaaat tagcaatcat tttatataaa gaaaatcaat 
2761 gaaggatttc ttattttctt ggaattttgt aaaaagaaat tgtgaaaaat gagcttgtaa 
.20 2821 atactccatt attttatttt atagtctcaa atcaaataca tacaacctat gtaattttta 
2881 aagcaaatat ataatgcaac aatgtgtgta tgttaatatc tgatactgta tctgggctga 
2941 ttttttaaat aaaatagagt ctggaatgct aaaaaaaaaa aaaa 

SEQ ID NO: 26 

Amino acid sequence of human CALCRL encoded by the DNA sequence shown in SEQ ID 
25 NO: 25. 

MEKKCTLYFLVLLPFFMILVTAELEESPEDSIQLGVTRNKIMTAQYECYQKIMQDPIQQA 
EGVYOTRTWDGWLCWNDVAAGTESMQLCPDYFQDFDPSEKVTKICDQDGNWFRHPASNRT 
WTNYTQCKVNTHEKVKTAl^FYLTIIGHGLSIASLLISLGIFFYFKSLSCQRITLHKNL 
FFSFVCNSWTI IHLTAVANNQ7VLVATNPVSCKVSQFIHLYLMGCNYFWMLCEGIYLHTL 
30 I WAVFAEKQHLMWYYFLGWGFPLIPACIHAIARSLYYNDNCWI SSDTHLLYI IHGPICA 
ALLVNLFFLI^IVRVLITKLKVTHQAESNLYM^ 

AEEVYDYIMHILMHFQGLLVSTIFCFFNGEVQAILRRNWNQYKIQFGNSFSNSEALRSAS 
YTVSTISDGPGYSHDCPSEHLNGKSIHDIENVLLKPENLYN 

SEQ ID NO: 27 

35 gi|9055257|reflNM J)l 8782. 1 1 Mus musculus calcitonin receptor-like (Calcrl), mRNA 

1 tgagtctgga gacaattgtg tatgtatact tttcttaaga tattaaaaaa caaatccaag 
61 gtcacaggtt gcttattgat agaagagaaa caatacggat agaagagaaa tcagaaaatt 
121 gcttatgatt gacaagaaca gctgcagcag ctacctagct tgaacataca gcacatttca 
181 tttggactct aataatggat aaaaagcata tactatgttt tctggttctc ttgcctctta 

40 241 atatggctct catctcagca gagtcggaag aaggcgtgaa ccaaacagac ttgggagtca 
301 ctagaaacaa gatcatgacg gctcaatatg aatgttacca gaagatcatg caggacccca 
361 ttcaacaagc agaaggcctt tactgcaata ggacctggga cggatggcta tgctggaatg 
421 acgttgcagc agggacggaa tcaatgcagt actgccctga ctattttcag gattttgatc 
481 cttcagagaa ggttacaaag atctgtgacc aagatggaca ctggtttcgg catccggata 

45 541 gtaatagaac atggaccaac tacaccctgt gtaataacag cacgcatgag aaagtgaaga 
601 cagccctgaa tctgttctac ctaactataa ttggacatgg attatctatt gcatctctga 
661 tcatctctct catcatattt ttttacttca agagcctaag ttgccaacgg atcacattgc 
721 ataaaaacct gttcttttca tttatttgta attcgattgt aacaatcatc cacctcacgg 
781 cagtggccaa taaccaggcc ttagtggcca caaatcctgt gagctgcaaa gtgtctcagt 

50 841 ttatccatct ctacctgatg ggctgtaact acttctggat gctctgtgaa ggcgtttacc 
901 tgcacacact catcgtggtg gctgtgtttg cggagaagca gcacttgatg tggtattatt 
961 ttctcggctg ggggtttcct ctgcttcctg cctgcatcca cgccattgcc agaagcttgt 
1021 attacaacga caattgctgg atcagctcag acactcatct cctctacatt atccatggtc 
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1081 cgatttgtgc tgctttgttg gtaaatctct ttttcctatt aaatattgta cgtgttctca 
1X41 tcaccaagtt gaaagttaca caccaagtgg aatccaatct ctacatgaaa gccgtaagag 
1201 ctactctcat cttggtacca ctacttggca ttgaatttgt gctttttccg tggcggcctg 
1261 aaggaaaggt tgcagaggag gtgtatgact atgtcatgca cattttgatg cactttcagg 
5 1321 gtcttttggt ggctactatt ttctgcttct ttaatggaga ggttcaagca attctgagaa 
1381 gaaattggaa ccagtataaa atccaatttg gaaatggctt ttcccactct gatgctctcc 
1441 gcagtgcatc ctacacagtg tcaacaatca gtgacatgca agggtacagc catgactgcc 
1501 ccactgaaca cttaaatgga aaaagcatcc aggatattga aaatgttgcc ttaaaatcag 
1561 aaaatacgta tgatctagtg atgtgaaaat atataaaatc acacacttga acctatgatt 

10 1621 ttatagccaa aagattggat agccagcaaa tggatttctg gaataccatg aagaaagccc 
1681 tcaaatgaaa tgggaattgt atggataaat gtttaacaac cctctctcta tgggggagaa 
1741 aagcctcaat ttatattgtt ggccagtaac tactcctacc ataactgatt tcaagttacc 
1801 aacctgacat cactgaatgt gtaattggaa agaaaataag cacaaacaac tcccaggagc 
1861 tgacatgttc tggaacctgc acagcactgc atgccctcag agatggaagt ctgtgcacac 

15 1921 tctcacccac actgaccaga actctgctct tctcatctga ggagacctca ctcggactta 
1981 cagacatgaa aggaaagttt ggtttttgtt ataaaactcc tcatcctttt gtcatttggt 
2041 gatagtgaat ggtttgtcca gaaacacttt aaccctcttt ataacttctc ttgtagacta 
2101 gacaaagagt tcattatctc tgtggtctca tgttactttc tttaagaaaa gcaactgagc 
2161 agaattccca tgggcttctt agttgctgct acatattgtt gtatcctgtg gtatatgcac 

20 2221 tctgaatcac tagggagaca tattggtaga agctacacct tgtcggctct ctctctctct 
2281 ctctctctct ctctctctct ctctctctct ctctctcggc aagaatagga ggaaatcagt 
2341 tttctggaat ctgtaaatag aaactttggc ctttccattt ctactgtgta gataagtaag 
2401 tgatcaacca ttttacatga agggaatcaa tgaaggattt ctttttatct tgggaatctg 
2461 agagagagag agagagagag agagagagag agagagagag agagagagag agagagagag 

25 2521 aggttgtaca aatgaaactg taagtactct gtaattttat tttattgttt caattcaaaa 
2581 cacaacctag gtaactttta aagcagatag atgatgcaac attatatgca acttagtatc 
2641 tgatactgtg tctgggctga tttattttat agtaaaatag aatcatgaat tctatacttg 
2701 gtaaatattt taaggacaaa cagataccag catcagaagt ttgagaacta aaaacaaacc 
2761 ccagaaatat caatgataag atatgaaata tttatttaaa aatgtaaagc cataatttta 

30 2821 caagcatgat atatttggtg tgctgacata ctgcttccaa tatgacatat tccaatttga 
2881 tgacaactca ttcaaattca gaaattatga acaactactg taagaagtgc tagtctgcta 
2941 catttgtggg ctttaattca ctaaattata gctttactga tttttactaa atgaagatac 
3001 atgcttttga aaaagaccca ggttattatt tttttatgaa ccacagagga attttctcta 
3061 tgtaataaca ctgtttcagt atttcagtag actactgtgg atgataatac aagccacggg 

35 3121 aagacttgaa cttaccactc attgaggatg aagtgtcaga tatttaagta ataattaaat 
3181 taagacagaa gaagattttg cttgtgagtt taagtgacat tttaaatgga tttagttttt 
3241 ccaatattgc ttggtttgga gtttctgaag aacaagattg acttcagtta gtaaaagtca 
3301 ctctggggac agttacatga gtccatgttt tttaaagtgt gtctgtcagt ttacagtgga 
3361 tgagaggagt gactcagccc tcactcattg tcttaagcta tagaagagaa taaattaaaa 

40 3421 agttgtttga aactgtcaag gaaaagaatt caaattatca agcactgatg tattacatga 
3481 tctgacagac atatttgtaa tttctttttt tttttttctt tctttttttt tatatttgta 
3541 attttctgaa ctttcagctg cttaaaccgt acaaaccttc cttttaatac cttcagactt 
3601 taggtattga ttattcatat cttcgctgac agtgttcgtg taattctgca ttcttctaaa 
3661 caaactgtca agtattaaca ataaatggta gtacagaatg ccgaattacc agagaacata 

45 3721 cttaaaaggt attgtaattt taagttttat tattttttta ctataccgag tgtagctgag 
3781 atttttttac tgccaatatc tactgtggtg tcattctcag ccatagcagt ctaagattag 
3841 tcagtattag ttttgtaata gtatttttga agctggattt ctctaaagct gaagtgccaa 
3901 ttaaaatatt tctataatgc ttgtagaaaa aaaatccaaa aattttgaaa gaaaacttct 
3961 ctggtgccta aaaattaact tggggagttt ttagctttta aaagttgcat ctattatttt 

50 4021 tctctaatac tttgttagag tcatggctga cacttttaga agcgactttg tatttttttg 
4081 ttgataaacc agccttcctg gaggttagtg ttttttcaaa tataacaaac tgtgtagggt 
4141 ttaggagtag tcagtgttat acttactgtt tagatatttt aatgtgaatc tattttaatt 
4201 tcctttgtgt tttaatttgc tcatagaaac attatatgta aaacatactc aagttcactt 
4261 aaaagaactc atattttatc gacataaaat ctctacttag gttttatgag aatattaaag 

55 4321 ttgtaagtta tcactcactt tcacgcatcc tccctggact atagagcaat tgaattcctc 
N 4381 acaaaattct tgcagcaaat aaagaaatac tttgcaaaat aaaaaaaaaa aaaaaaaaaa 
4441 aaaaaaa 

SEQ ID NO: 28 
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Amino acid sequence of mouse CALCRL encoded by the DNA sequence shown in SEQ ID 
NO: 27. 

MDKKHILCFLVlAPIJmALISAESEEGWQTDLGVTRNKIMTAQYECYQKIMQDPIQQAE 
GLYCNRTWDGWLCWNDVAAGTESMQYCPDYFQDFDPSEK\rTKJC^DGHWFRHPDSNRTW 
5 TNYTLCNNSTHEKVKTALNLFYLTI IGHGLS IASLI ISLI IFFYFKSLSCQRITLHKNLF 
FSFICNSIVTIIHLTAVANNQALVATNPVSCKVSQFIHLYLMGCirYFWMLCEGVYLHTLI 
WAVFABKQHLMWYY FLGWGFPLLPACIHAI ARS LYYNDNCW ISSDTHLL YI IHGPICAA 
LLVNLFFLLN I VR VL I TKLKVTHQVESNLYMKAVRATL I LVPLLG I EFVLFPWRPEGKVA 
EBVYDYVMHILMHFQGLLVATIFCFFNGEVQAILRRNWNQYKIQFGNGFSHSDALRSASY 
1 0 TVSTISDMQGYSHDCPTEHLNGKSIQDIENVALKSENTYDLVM 

SEQ ID NO: 29 

gi|6978586|ref)NM_012717.1| Rattus norvegicus calcitonin receptor-like (Calcrl), mRNA 

1 gggaactctt ttctactatc tcagaaaatc aaattccctc ctgagactat tcacacagaa 
61 tttccttagg agccgtgcgg ggccctatga tgatacacca acatcttgtg ctaccaactg 
15 121 tcaaaacttt ggatggtcta acactcaggt catgaccctt gtaattaaga aattctcgaa 
181 gacaatattg agcatgattc aggagaaaat gtgatttgag tctggagaca attgtgcatg 
241 tatatctttt cttaagatct taaaaagaaa gagaaaacaa atccagggtt gtcggtcact 
301 gattgataga agagaaaaca atattcagac gattgcttat gattgacaag agcagctgcg 
361 gccgctaccc agcttgaaca tatcgcacac ttcagttgga ctctaatgat ggataaaaag 
20 421 tgtacactat gttttctgtt tctcttgctt cttaatatgg ctctcatcgc agcagagtcg 
481 gaagaaggcg cgaaccaaac agacttggga gtcactagga acaagatcat gacggctcag 
541 tatgaatgtt accaaaagat catgcaggat cccattcaac aaggagaagg cctttactgc 
601 aacagaacct gggacggatg gctatgctgg aatgacgttg cagcaggaac cgagtcaatg 
661 cagtactgcc ctgattactt tcaagatttt gatccttcag agaaggttac aaagatctgt 
25 721 gaccaagatg gaaactggtt cagacatcca gatagtaaca ggacatggac aaactacacc 
781 ttgtgtaaca acagcacgca tgagaaagtg aagacagcac tgaatttgtt ctacctaact 
841 ataattggac atggattatc tattgcctct ctgatcatct cactcatcat atttttttat 
901 ttcaagagcc taagttgcca acggattaca ttgcataaaa acctgttctt ttcatttgtt 
961 tgtaattcga ttgtgacaat cattcacctc acggcagtgg ccaataacca ggccttagtg 
30 1021 gccacaaatc ctgtgagctg caaggtgtcc cagttcattc atctttacct gatgggctgt 
1081 aactactttt ggatgctctg tgaaggcatt tacctgcaca cactcattgt ggtggctgtg 
1141 tttgcagaga agcagcactt gatgtggtat tattttcttg gctgggggtt tcctctgctt 
1201 cctgcctgca tccatgccat cgccagaagc ttgtattaca atgacaactg ctggatcagc 
1261 tcagacactc atctcctcta catcatccat ggtcccattt gtgctgcttt actggtaaat 
35 1321 ctctttttcc tattaaatat tgtacgtgtt ctcatcacca agttgaaagt tacacaccaa 
1381 gcagaatcca atctctacat gaaagctgta agagccactc tcatcttggt accactactt 
1441 ggcattgaat ttgtgctttt tccatggcgg cctgaaggaa aggttgctga ggaggtgtat 
1501 gactatgtca tgcacattct catgcactat cagggtcttt tggtgtctac aattttctgc 
1561 ttctttaacg gagaggttca agcaattctg agaagaaatt ggaaccagta taaaatccaa 
40 1621 tttggcaatg gcttttccca ctctgatgct ctccgcagcg catcctatac ggtgtcaaca 
1681 atcagcgatg tgcaggggta cagccacgac tgccccactg aacacttaaa tggaaaaagc 
1741 atccaggata ttgaaaatgt tgccttaaaa ccagaaaaaa tgtatgatct agtgatgtga 
1801 aaatacagaa atctcacttg ttttgtgtca cttgaaccta tgattttata gtcggaagat 
1661 ttaatagcaa atgggtttct tgaataccat gaagaaagcc ctcaaatgaa atgagaattg 
45 1921 tgtggataaa tgtttaacga ccctctctct atgggggaga aaagcctcaa tttttattgt 
1981 tggccagtaa atactcctac catagctgat ttcaagttac caacctgaca tcaccgaatg 
2041 tggaattgga aagaaaataa gcacaatcaa ctcttgggag ctgacatgcg tcgtc 



SEQ 1) NO: 30 

50 Amino acid sequence of rat CALCRL encoded by the DNA sequence shown in SEQ ED NO: 
29. 
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MMDKKCTLCPLFLLLLNMALIAABSEEGANQTDLGVTRNKIMTAQyECTQKIM 
EGLYCNRTWDGWLCWNDVAAGTESMQYCPDYFQDro 

WTNYTLCNNSTHEKVKTALNLFYLTIIGHGLSIASLIISLIIFFYFKSLSCQRITLHKNL 
FFSFVCNS IVTI IHLTAVANNQALVATNPVSCKVSQFIHLYLMGCNYFWMLCEGI YLHTL 
5 IWAWAEKQHLMVTYYFLGWGFPLLPACIHAIARSLYYNDNCW^ 
ALLVNLFFLLNIVRVLITKLKVTHQAESNL^ 

AEEVYDYVMHILMHYQGLLVSTIFCFFNGEVQAILRRNWNQYKIQFGNGFSHSDALRSAS 
YTVSTISDVQGYSHDCPTEHLNGKSIQDIENVALKPEKMYDLVM 

SEQIDN0:31 

1 0 gi|22538799|reflNM_005408*2| Homo sapiens chemokine (C-C motif) ligand 1 3 (CCL1 3), 
mRNA 

1 aaaaggccgg cggaacagcc agaggagcag agaggcaaag aaacattgtg aaatctccaa 
61 ctcttaacct tcaacatgaa agtctctgca gtgcttctgt gcctgctgct catgacagca 
121 gctttcaacc cccagggact tgctcagcca gatgcactca acgtcccatc tacttgctgc 

15 181 ttcacattta gcagtaagaa gatctccttg cagaggctga agagctatgt gatcaccacc 
241 agcaggtgtc cccagaaggc tgtcatcttc agaaccaaac tgggcaagga gatctgtgct 
301 gacccaaagg agaagtgggt ccagaattat atgaaacacc tgggccggaa agctcacacc 
361 ctgaagactt gaactctgct acccctactg aaatcaagct ggagtacgtg aaatgacttt 
421 tccattctcc tctggcctcc tcttctatgc tttggaatac ttctaccata attttcaaat 

20 . 481 aggatgcatt cggttttgtg attcaaaatg tactatgtgt taagtaatat tggctattat 
541 ttgacttgtt gctggtttgg agtttatttg agtattgctg atcttttcta aagcaaggcfc 
601 ttgagcaagt aggttgctgt ctctaagccc ccttcccttc cactatgagc tgctggcagt 
661 gggtttgtat tcggttccca ggggttgaga gcatgcctgt gggagtcatg gacatgaagg 
721 gatgctgcaa tgtaggaagg agagctcttt gtgaatgtga ggtgttgcta aatatgttat 

25 781 tgtggaaaga tgaatgcaat agtaggactg ctgacatttt gcagaaaata cattttattt 
841 aaaatctcct aaaaaaaaaa a 

SEQ ID NO: 32 

Amino acid sequence of human CCL13 encoded by the DNA sequence shown in SEQ ID 
NQ:31. 

30 MKVSAVLLCLLLMTAAFNPQGLAQPDALNVPSTCCFTFSSKKISLQRLKSYVITTSRCPQ 
KAVI FRTKLGKE I CADPKEKWVQNYMKHLGRKAHTLKT 

SEQ ID NO: 33 

Amino acid sequence of human CCL13, a soluble active secreted form derived from SEQ ID 
NO:32. 

35 QPDALNVPSTCCFTFSSKXISLQRLKSYVITTSRCPQKAVI FRTKLGKE I CADPKEKWVQ 
NYMKHLGRKAHTLKT 

SEQ ID NO: 34 

gi|203 8 1 46 1 |gb|BC027520. 1 1 Mus musculus chemokine (C-C motif) ligand 12, mRNA 
(cDNA clone MGC:4l 146 IMAGE: 1548072), complete cds 

40 1 ttgacctcaa catgaagatt tccacacttc tatgcctcct gctcatagct accaccatca 

61 gtcctcaggt attggctgga ccagatgcgg tgagcacccc agtcacgtgc tgttataatg 
121 ttgttaagca gaagattcac gtccggaagc tgaagagcta caggagaatc acaagcagcc 
181 agtgtccccg ggaagctgtg atcttcagga ccatactgga taaggagatc tgtgctgacc 
241 ccaaggagaa gtgggttaag aattccataa accacttgga taagacgtct caaaccttca 
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301 tccttgaacc ttcatgtcta ggctgagagt tccaaaaact cttacgtatt tccccctgaa 

361 gttccccacg ggcagtgtga tatttattat gatatctaaa aagagatgtt tttaataatt 

421 taaacaaact tgcttaaata atatttaatg gtatttaagt aatatttggg ccaattaaac 

481 cgaatctaat ttaaaaaaaa aaaaaaaaaa aaaaaa 

5 SEQIDNO:35 

Amino acid sequence of mouse CCL13 encoded by the DNA sequence shown in SEQ ID 
NO: 34. 

MKISTLLCLLLIATTISPQVIAGPDAVSTPVTCCYNVVKQKIHVRKLKSYRRITSSQCPR 
EAVIFRTILDKEICADPKEKWVKNSINHLDKTSQTFILEPSCIX3 

10 SEQ ID NO: 36 

gi|27674224|ref]XM_21 3425.1 1 Rattus norvegicus similar to small inducible cytokine A12 
precursor (LOC287562), mRNA 

1 ggctcctgag tcctccagct ctcattccaa agcctttggc ctcaacatga agatctccac 
61 ccttctttgc cttctgctca tagctgccgc catcagccca caggtgttgg ctggaccaga 
15 121 ttcagtgttc accccagtca cctgctgtta taatgtcgct aagcagaaga tccacattcg 
181 gaggctaaag agctacagga aaatcacaag cagccagtgt ccccgggaag ctgtgatctt 
241 cagaactgta ctggataagg agctctgtgc tgaccccaag gagaagtggg ttaaggactc 
301 catgaaccac ttggatcaga agtctcgaac tcagcatcct tgaaccttca cgtctaggct 
361 gaaagttcca gaaaaattac atttatttcc tctgaccttc cccatggaca gtgcgatagt 
20 421 tatttattat gatatctaaa gagagatgct tttaataatt taaaacacaa acttactgaa 
481 gtaatattta atgatatcta agttatattt gggccaatta aactgacttt aattt 



SEQ ID NO: 37 

Amino acid sequence of rat CCL13 encoded by the DNA sequence shown in SEQ ID NO: 36. 

25 MKISTLLCLLLIAAAISPQVLAGPDSVPTPVTCCyNVAKQKIHIRRLKSYRKITSSQCPR 
EAVIFRTVLDKELCADPKEKWVKDSMNHLDQKSRTQHP 

SEQ ID NO: 38 

gi|22538815|reflNM 005623.2| Homo sapiens chemokine (C-C motif) ligand 8 (CCL8), 
mRNA 

30 1 gtgatggaga gcaccagcaa agccttaggg cccatccctg gcctcctgtt acccacagag 

61 gggtaggccc ttggctctct tccactatga cgtcagcttc cattcttcct ttcttataga 
121 caattttcca tttcaaggaa atcagagccc ttaatagttc agtgaggtca ctttgctgag 
181 cacaatccca tacccttcag cctctgctcc acagagccta agcaaaagat agaaactcac 
241 aacttccttg ttttgttatc tggaaattat cccaggatct ggtgcttact cagcatattc 

35 301 aaggaaggtc ttacttcatt cttccttgat tgtgaccatg cccaggctct ctgctcccta 
361 taaaaggcag gcagagccac cgaggagcag agaggttgag aacaacccag aaaccttcac 
421 ctctcatgct gaagctcaca cccttgccct ccaagatgaa ggtttctgca gcgcttctgt 
481 gcctgctgct catggcagcc actttcagcc ctcagggact tgctcagcca gattcagttt 
541 ccattccaat cacctgctgc tttaacgtga tcaataggaa aattcctatc cagaggctgg 

40 601 agagctacac aagaatcacc aacatccaat gtcccaagga agctgtgatc ttcaagacca 
661 aacggggcaa ggaggtctgt gctgacccca aggagagatg ggtcagggat tccatgaagc 
721 atctggacca aatatttcaa aatctgaagc catgagcctt catacatgga ctgagagtca 
781 gagcttgaag aaaagcttat ttattttccc caacctcccc caggtgcagt gtgacattat 
841 tttattataa catccacaaa gagattattt ttaaataatt taaagcataa tatttcttaa 
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901 aaagtattta attatattta agttgttgat gttttaactc tatctgtcat acatcctagt 

961 gaatgtaaaa tgcaaaatcc tggtgatgtg ttttttgttt ttgttttcct gtgagctcaa 

1021 ctaagttcac ggcaaaatgt cattgttctc cctcctacct gtctgtagtg ttgtggggtc 

1081 ctcccatgga tcatcaaggt gaaacacttt ggtattcttt ggcaatcagt gctcctgtaa 

5 1141 gtcaaatgtg tgctttgtac tgctgttgtt gaaattgatg ttactgtata taactatgga 

1201 attttgaaaa aaaatttcaa aaagaaaaaa atatatataa tttaaaacta aaaaaaaaaa 

1261 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 

1321 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 

SEQIDNO: 39 

1 0 Amino acid sequence of human CCL8 encoded by the DNA sequence shown in SEQ ID NO: 
38. 

MKVSAALLCLLIiMAATFSPQGLAQPDSVSIPITCCFNVINRKIPIQRLESYTRITNIQCP 
KBAVIFKTKRGKEVCADPKERWVRDSMKHLDQIPQNLKP 

SEQ ID NO: 40 

1 5 Amino acid sequence of human CCL8, a soluble active secreted form derived from SEQ ID 
NO:39. 

QPDSVSIPITCCFNVINRKIPIQRLESYTRITNIQCPKEAVIFKTKRGKEVCADPKERWV 
RDSMKHLDQ I FQNLKP 

SEQ ID NO: 41 

20 gi|20381461|gb|BC027520.1 1 Mus musculus chemokine (C-C motif) ligand 12, mRNA 
(cDNA clone MGC:41 146 IMAGE: 1548072), complete cds 

1 ttgacctcaa catgaagatt tccacacttc tatgcctcct gctcatagct accaccatca 

61 gtcctcaggt attggctgga ccagatgcgg tgagcacccc agtcacgtgc tgttataatg 

121 ttgttaagca gaagattcac gtccggaagc tgaagagcta caggagaatc acaagcagcc 

25 181 agtgtccccg ggaagctgtg atcttcagga ccatactgga taaggagatc tgtgctgacc 

241 ccaaggagaa gtgggttaag aattccataa accacttgga taagacgtct caaaccttca 

301 tccttgaacc ttcatgtcta ggctgagagt tccaaaaact cttacgtatt tccccctgaa 

361 gttccccacg ggcagtgtga tatttattat gatatctaaa aagagatgtt tttaataatt 

421 taaacaaact tgcttaaata atatttaatg gtatttaagt aatatttggg ccaattaaac 

30 4 81 cgaatctaat ttaaaaaaaa aaaaaaaaaa aaaaaa 

SEQIDNO: 42 

Amino acid sequence of mouse CCL8 encoded by the DNA sequence shown in SEQ ID NO: 
41. 

MKI STLLCLLL I ATTI S PQVLAGPDAVSTPVTCCYNWKQKIHVRKLKSY RR ITS SQCPR 
35 EAVI FRTI LDKEI CADPKEKWVKNS INHLDKTSQTF I LEPS CLG 

SEQ ID NO: 43 

gi|27674224|ref)XM_213425.1| Rattus norvegicus similar to small inducible cytokine A12 
precursor (LOC287562), mRNA 

1 ggctcctgag tcctccagct ctcattccaa agcctttggc ctcaacatga agatctccac 
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61 ccttctttgc cttctgctca tagctgccgc catcagccca caggtgttgg ctggaccaga 
121 ttcagtgttc accccagtca cctgctgtta taatgtcgct aagcagaaga tccacattcg 
181 gaggctaaag agctacagga aaatcacaag cagccagtgt ccccgggaag ctgtgatctt 
241 cagaactgta ctggataagg agctctgtgc tgaccccaag gagaagtggg ttaaggactc 
5 301 catgaaccac ttggatcaga agtctcgaac tcagcatcct tgaaccttca cgtctaggct 
361 gaaagttcca gaaaaattac atttatttcc tctgaccttc cccatggaca gtgcgatagt 
421 - tatttattat gatatctaaa gagagatgct tttaataatt taaaacacaa acttactgaa 
4 81 gtaatattta atgatatcta agttatattt gggccaatta aactgacttt aattt 



10 SEQIDNO:44 

Amino acid sequence of rat CCL8 encoded by the DNA sequence shown in SEQ ID NO: 43. 

MKISTLLCLLLIAAAISPQVLAGPDSVPTPVTCCYNVAKOKIHIRRLKSYRKITSSQCPR 
EAVIFRTVLDKELCADPKEKWVKDSMNHIiDQKSRTQHP 

SEQ ID NO: 45 

1 5 gi|455701 7|ref)NM_001276. 1 1 Homo sapiens chitinase 3-like 1 (cartilage glycoprotein-39) 
(CHI3Ll),mRNA 

1 agtggagtgg gacaggtata taaaggaagt acagggcctg gggaagaggc cctgtctagg 
61 tagctggcac caggagccgt gggcaaggga agaggccaca ccctgccctg ctctgctgca 
121 gccagaatgg gtgtgaaggc gtctcaaaca ggctttgtgg tcctggtgct gctccagtgc 
20 181 tgctctgcat acaaactggt ctgctactac accagctggt cccagtaccg ggaaggcgat 
241 gggagctgct tcccagatgc ccttgaccgc ttcctctgta cccacatcat ctacagcttt 
301 gccaatataa gcaacgatca catcgacacc tgggagtgga atgatgtgac gctctacggc 
361 atgctcaaca cactcaagaa caggaacccc aacctgaaga ctctcttgtc tgtcggagga 
421 tggaactttg ggtctcaaag attttccaag atagcctcca acacccagag tcgccggact 
25 481 ttcatcaagt cagtaccgcc attcctgcgc acccatggct ttgatgggct ggaccttgcc 
541 tggctctacc ctggacggag agacaaacag cattttacca ccctaatcaa ggaaatgaag 
601 gccgaattta taaaggaagc ccagccaggg aaaaagcagc tcctgctcag cgcagcactg 
661 tctgcgggga aggtcaccat tgacagcagc tatgacattg ccaagatatc ccaacacctg 
721 gatttcatta gcatcatgac ctacgatttt catggagcct ggcgtgggac cacaggccat 
30 781 cacagtcccc tgttccgagg tcaggaggat gcaagtcctg acagattcag caacactgac 
841 tatgctgtgg ggtacatgtt gaggctgggg gctcctgcca gtaagctggt gatgggcatc 
901 cccaccttcg ggaggagctt cactctggct tcttctgaga ctggtgttgg agccccaatc 
961 tcaggaccgg gaattccagg ccggttcacc aaggaggcag ggacccttgc ctactatgag 
1021 atctgtgact tcctccgcgg agccacagtc catagaaccc tcggccagca ggtcccctat 
35 1081 gccaccaagg gcaaccagtg ggtaggatac gacgaccagg aaagcgtcaa aagcaaggtg 
1141 cagtacctga aggataggca gctggcaggc gccatggtat gggccctgga cctggatgac 
1201 ttccagggct ccttctgcgg ccaggatctg cgcttccctc tcaccaatgc catcaaggat 
1261 gcactcgctg caacgtagcc ctctgttctg cacacagcac gggggccaag gatgccccgt 
1321 ccccctctgg ctccagctgg ccgggagcct gatcacctgc cctgctgagt cccaggctga 
40 1381 gcctcagtct ccctcccttg gggcctatgc agaggtccac aacacacaga tttgagctca 
1441 gccctggtgg gcagagaggt agggatgggg ctgtggggat agtgaggcat cgcaatgtaa 
1501 gactcgggat tagtacacac ttgttgatga ttaatggaaa tgtttacaga tccccaagcc 
1561 tggcaaggga atttcttcaa ctccctgccc cctagccctc cttatcaaag gacaccattt 
1621 tggcaagctc tatcaccaag gagccaaaca tcctacaaga cacagtgacc atactaatta 
45 1681 taccccctgc aaagccagct tgaaaccttc acttaggaac gtaatcgtgt cccctatcct 
1741 acttcccctt cctaattcca cagctgctca ataaagtaca agagtttaac agtgtgttgg 
1801 cgctttgctt tggtctatct ttgagcgccc actagaccca ctggactcac ctcccccatc 
1861 tcttctgggt tccttcctct gagccttggg acccctgagc ttgcagagat gaaggccgcc 
1921 atgtt 

50 SEQ ID NO: 46 
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Amino acid sequence of human CHI3L1 encoded by the DNA sequence shown in SEQ ID 
NO: 45. 

MGVKASQTGFVVLVLLQCCSAYKLVCYYTSWSQYR^ 

ISOTHIDTWEWNDVTLYGMIiNTLKNWJPNLKTLLSVGGWNFGSQRPSKIASKTrQSRRTPI 
5 KSVPPFLRTHGFIXSLDIJVWLYPGRIU^KQHFTTLlKBMKAEFIKEAQPGKKQLIiLSAALSA 
GKVTIDSSYDIAKISQHLDFISIMTYDPHGAWRGTTGHHSPLFRGQEDASPDRFSNTDYA 
VGYMLRLGAP AS KLVMG I PTFGRS FTLAS SETGVGAP ISGPGI PGRFTKEAGTLAYYE I C 
DFLRGATVHRTLGQQWYATKGNQWGYDDQESVKSKVOYLKDRQIAGAMWALDI^^ 
GSFCGQDLRFPLTNAIKDALAAT 

10 SEQ ID NO: 47 

Amino acid sequence of human CHI3L1, a soluble active secreted form derived from SEQ ID 
NO:46. 

YKLVCYYTSWSQYREGDGSCFPDALDRFLCTH 1 1 YSFANI SNDH IDTWEWNDVTLYGMLN 
TLKNRNPNLKTLLSVGGWNFGSQRFSKIASNTQSRRTFIKSVPPFLRTHGFDGLDLAWLY 
1 5 PGRRDKQHFTTLIKEMKAEFIKEAQPGKKQLLLSAALSAGKVTIDSSYDI AKISQHLDFI 
S IMT YDFHGAWRGTTGHHS PLFRGQEDAS PDRFSNTD Y AVG YMLRLGAP AS KLVMG I PTF 
GRS FTLASS ETG VGAP I SGPG I PGRFTKEAGTLAYYE I CDFLRGATVHRTLGQQVPYATK 
GNQWVGYDDQESVKSKVQYLKDRQLAGAPTVWALDLDDFQGSFCGQDLRPPLTNAIKDALA 
AT 

20 SEQ ID NO: 48 

gi|33468846|reflNM^007695.1| Mus musculus chitinase 3-like 1 (Chi3ll), mRNA 

1 cggacgcgtg ggcggacgcg tggggctggg tactaggaga agccatcatg cacacctcta 
61 ctgaagccag gatgggcatg agggcggcac tgacaggctt tgcggtcctg atgctgctcc 
121 agagctgctc tgcgtacaag ctggtctgct acttcaccag ctggtcccag taccgggaag 
25 181 gcgttggaag cttcttacca gacgccatcc aacctttcct gtgcacccac atcatctaca 
241 gctttgccaa catcagcagc gacaacatgc ttagcacatg ggagtggaat gacgagtcga 
301 actatgacaa gctgaataaa ctgaagacca gaaacaccaa cctgaagacc ctcctgtctg 
361 ttggagggtg gaaatttggc gaaaaaagat tttccgagat tgcctccaac actgagagac 
421 gcactgcttt cgtccggtcg gtagccccgt tcctgcgttc ttatggcttt gatgggctgg 
30 481, atctcgcctg gctctaccct cgcttaagag acaagcagta tttctccacc ctgatcaagg 
541 aactgaatgc ggaattcaca aaggaggtcc agccaggcag agagaaactc ctgctcagcg 
601 cagctttgtc agcaggaaag gtggccattg acactggcta tgacatcgcc cagatagccc 
661 aacacctgga ttttatcaat ctcatgacct acgatttcca tggagtctgg cgccaaatca 
721 caggccacca cagccccctc ttccaaggcc agaaggacac taggtttgac agatacagca 
35 781 atgtgaacta tgccgtgcag tacatgatac gtctgggagc ccaggccagc aagctactga 
841 tgggcatccc cacctttggg aagagcttca ctctggcatc ttctgaaaat cagttgggag 
901 ctccaatctc aggggaagga ttaccaggcc ggttcaccaa ggaggcaggg accctggcct 
961 actacgagat atgcgacttc ctcaaaggag ctgaagtaca tcgactctcc aacgagaagg 
1021 ttcccttcgc taccaagggc aaccagtggg tggggtatga ggacaaggag agtgtcaaaa 
40 1081 acaaggttgg gttcctgaag gagaagaagc tggcaggagc catggtgtgg gcactggatt 
1141 tggatgattt ccagggcacc tgtcagccga aggaattctt cccgctcacc aacgccatca 
1201 aggatgccct ggcttagctc cccctttccc atatggtacc cccactctct ggccaggagt 
1261 ttaatctctt gcaatgttaa gttccccaac tgagcc<:cag tttctccttc ccttggcacc 
1321 tgtgtaaggg gccacagcag gctcagctat ggagaacagg gaactagggt aggacgatgg 
45 1381 tggggttgtg agagtcacag tgtgagcaga tacacaaccc tgttaaggaa tgcaaattct 
1441 cagactctaa cctcccttta cccagcctga ccaaaggaca ccacttggat caagtaggca 
1501 aatatcttac aggattgagg gaccatacta attataccct ctgcaaagcc caacttgaat 
1561 ccttccctta ggaacttaat cgtcccactt ccctttccct aattccacag ctgttcaata 
1621 aagcgccaga acctaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 
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SEQ ID NO: 49 

Amino acid sequence of mouse CHI3L1 encoded by the DNA sequence shown in SEQ ID 
NO: 48. 

MGMRAALTGFAVIiMLLQSCSAYKLVCYFTSWSQYREGVGSPLPDAIQPFLCTHIIYSFAN 
ISSDNMLSTWEWNDESNYDKLNKLKTRNTNL^ 

\mSVAPFLRSYGFMLDlJVWLYPRLRDKQYFSTLIKELNAEFTKEVQPGREKLLLSAAIiS 
AGKVAI DTG YD I AQ I AQHLD F INLMTYDFHG VWRQI TGHHS PLFQGQKDTRFDR YSNVNY 
AVQYMIRLGAQASKLLMGI PTFGKSFTLASSENQLGAPI SGEGLPGRFTKEAGTLAYYEI 
CDFLKGAEVHRLSNEKVPFATKGNQWVGYEDKESVKNKVOFLKEIOCIAGAMVWALDLDDF 
QGTCQPKEFFPLTNAIKDALA 

SEQ ID NO: 50 

gi|34880227|refpCM_34n23.1| Rattus norvegicus chitinase 3-like 1 (cartilage glycoprotein- 
39)(Chi311),mRNA 

1 gcctgaacag agggctggag ctgcagacag gagctgccgg gaatgctggg agactactgg 
61 caagaagctt tgcggtcctg atgctgctcc agagctgctc tgcgtacaaa ctggtctgct 
121 actacaccaa ctggtcccag taccgggaag gcaatgggag ctgcttccca gatgccctcg 
181 accattccct gtgcacccat atcatctaca gctttgccaa catcagcaac aacaagctca 
241 gcacatcgga gtggaatgac gtaaccctgt atggcatgct gaatactctc aagaccagaa 
301 accccagact gaagacactg ctgtctgttg gaggatggag ctctggctca gaaagatttt 
361 ccaggattgt ctccaacgct aagagtcgca agactttcgt ccagtcggta gctcccttcc 
421 tgcggaccta tggctttgat ggactggatc tcgcctggct ctacccgggc ccgaaagaca 
481 agcaacattt taccacactg atcaaggaac tgaaggcgga attcacaaag gaagtccagc 
541 caggcacaga gaaactcctg ctcagtgctg ccgtgtcagc aggaaaggtg acccttgaca 
601 gtggctatga tgttgcccag atagcccaac acctagattt cattaatctc atgacctatg 
661 atttccatgg aacctggcgc cacaccacag gacatcacag ccccctcttc cgaggccagc 
721 aggacactgg gcctgacaga ttcagcaatg tggactatgg tgtggggtac atgctaaggc 
781 tgggagcccc caccaacaag ctagtgatgg gtatccccac ctttggaaag agcttcactc 
841 tggcatcttc tgagaatcaa gtgggagctc caatcacagg gtcaggatta ccaggccgct 
901 acaccaagga gaaagggacc ctcgcctact acgagatatg cgacttcctc agaggagctg 
961 aagtacatag aattcttggc cagcaggttc cctttgctac caagggcaac cagtgggtgg 
1021 ggtatgatga cccggagagc gtcaaaaaca aggtgaagta cctgaagaac aagcagctgg 
1081 caggagccat ggtgtgggca gtggatttgg atgatttccg gggctccttc tgtgggcata 
1141 acgtacactt cccgctcacc aacgccatca aggaggccct ggctgtggct tagctccccc 
1201 tttcccacat gctcccccca ccctccggcc aggagtctgg tcacttgcaa tgctaagtcc 
1261 cccgactggg tgtcagtttc cctcccttgg aacctgtgta gggggccaca gcaggctcgg 
1321 acctggtgaa cagggaaata gggtaggacg gtgggattgt gaaagtcacc gtgtgagata 
1381 gatacaggtc atccctgtta atgaatgcaa attctcaagg ctcttaactc cctttcccct 
1441 atctctcccg accaaaggac accaattttg gcaagttata gcatcaagta ggcacacatc 
1501 ttacaggatt cagggaccat actaattata ccttctgcaa agcccaactt gaaaccttcc 
1561 cttaggaact taatcgtctc tctgtcccac ttccctttcc ctaattccac agctgctcaa 
1621 taaagtgcca gagcttaaca gtg 

SEQ ID NO: 51 

Amino acid sequence of rat CHI3L1 encoded by the DNA sequence shown in SEQ ID NO: 
50. 

MLLQSCSAYKLVCYYTNWSQYREGNGSCFPDALDHSLCTHIIYSFANISNNKLSTSEWND 
VTLYGMI*NTLKTRNPRLKTLLSVGGWSFGSERFSRIVSNAKSRKTFVQSVAPFLRTYGFD 
GLDLAWL Y PGPKDKQHFTTL I KELKAE FTKEVQPGTEKLLL SAAVS AGKVTLDSGYDVAQ 
IAQHLDFINLMTYDFHGTWRHTTGHHSPLFRGQQDTGPDRFSNVDYGVGYMLRLGAPTNK 
LVMGIPTFGKSFTLASSENQVGAPITGSGLPGRYTKEKGTLAYYEICDFLRGAEVHRILG 
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QQVPFATKGNQWVGYDDPESVKNKVKYLKNKQLAGAMVWAVDLDDFRGSFCGHNVHFPLT 
NAIKEALAVA 

SEQIDNO:52 

gi|21536275|ref)NM_000651.3| Homo sapiens complement component (3b/4b) receptor 1, 
5 including Knops blood group system (CR1 ), transcript variant S, mRNA 

1 acactctggg cgcggagcac aatgattggt cactcctatt ttcgctgagc ttttcctctt 
61 atttcagttt tcttcgagat caaatctggt ttgtagatgt gcttggggag aatgggggcc 
121 tcttctccaa gaagcccgga gcctgtcggg ccgccggcgc ccggtctccc cttctgctgc 
181 ggaggatccc tgctggcggt tgtggtgctg cttgcgctgc cggtggcctg gggtcaatgc 
10 241 aatgccccag aatggcttcc atttgccagg cctaccaacc taactgatga atttgagttt 
301 cccattggga catatctgaa ctatgaatgc cgccctggtt attccggaag accgttttct 
361 atcatctgcc taaaaaactc agtctggact ggtgctaagg acaggtgcag acgtaaatca 
421 tgtcgtaatc ctccagatcc tgtgaatggc atggtgcatg tgatcaaagg catccagttc 
481 ggatcccaaa ttaaatattc ttgtactaaa ggataccgac tcattggttc ctcgtctgcc 
15 541 acatgcatca tctcaggtga tactgtcatt tgggataatg aaacacctat ttgtgacaga 
601 attccttgtg ggctaccccc caccatcacc aatggagatt tcattagcac caacagagag 
661 aattttcact atggatcagt ggtgacctac cgctgcaatc ctggaagcgg agggagaaag 
721 gtgtttgagc ttgtgggtga gccctccata tactgcacca gcaatgacga tcaagtgggc 
781 atctggagcg gccccgcccc tcagtgcatt atacctaaca aatgcacgcc tccaaatgtg 
20 841 gaaaatggaa tattggtatc tgacaacaga agcttatttt ccttaaatga agttgtggag 
901 tttaggtgtc agcctggctt tgtcatgaaa ggaccccgcc gtgtgaagtg ccaggccctg 
961 aacaaatggg agccggagct accaagctgc tccagggtat gtcagccacc tccagatgtc 
1021 ctgcatgctg agcgtaccca aagggacaag gacaactttt cacctgggca ggaagtgttc 
1081 tacagctgtg agcccggcta cgacctcaga ggggctgcgt ctatgcgctg cacaccccag 
25 1141 ggagactgga gccctgcagc ccccacatgt gaagtgaaat cctgtgatga cttcatgggc 
1201 caacttctta atggccgtgt gctatttcca gtaaatctcc agcttggagc aaaagtggat 
1261 tttgtttgtg atgaaggatt tcaattaaaa ggcagctctg ctagttactg tgtcttggct 
1321 ggaatggaaa gcctttggaa tagcagtgtt ccagtgtgtg aacaaatctt ttgtccaagt 
1381 cctccagtta ttcctaatgg gagacacaca ggaaaacctc tggaagtctt tccctttgga 
30 1441 aaagcagtaa attacacatg cgacccccac ccagacagag ggacgagctt cgacctcatt 
1501 ggagagagca ccatccgctg cacaagtgac cctcaaggga atggggtttg gagcagccct 
1561 gcccctcgct gtggaattct gggtcactgt caagccccag atcattttct gtttgccaag 
1621 ttgaaaaccc aaaccaatgc atctgacttt cccattggga cafcctttaaa gtacgaatgc 
1681 cgtcctgagt actacgggag gccattctct atcacatgtc tagataacct ggtctggtca 
35 1741 agtcccaaag atgtctgtaa acgtaaatca tgtaaaactc ctccagatcc agtgaatggc 
1801 atggtgcatg tgatcacaga catccaggtt ggatccagaa tcaactattc ttgtactaca 
1861 gggcaccgac tcattggtca ctcatctgct gaatgtatcc tctcgggcaa tgctgcccat 
1921 tggagcacga agccgccaat ttgtcaacga attccttgtg ggctaccccc caccatcgcc 
1981 aatggagatt tcattagcac caacagagag aattttcact atggatcagt ggtgacctac 
40 2041 cgctgcaatc ctggaagcgg agggagaaag gtgtttgagc ttgtgggtga gccctccata 
2101 tactgcacca gcaatgacga tcaagtgggc atctggagcg gcccggcccc tcagtgcatt 
2161 atacctaaca aatgcacgcc tccaaatgtg gaaaatggaa tattggtatc tgacaacaga 
2221 agcttatttt ccttaaatga agttgtggag tttaggtgtc agcctggctt tgtcatgaaa 
2281 ggaccccgcc gtgtgaagtg ccaggccctg aacaaatggg agccggagct accaagctgc 
45 2341 tccagggtat gtcagccacc tccagatgtc ctgcatgctg agcgtaccca aagggacaag 
2401 gacaactttt cacccgggca ggaagtgttc tacagctgtg agcccggcta cgacctcaga 
2461 ggggctgcgt ctatgcgctg cacaccccag ggagactgga gccctgcagc ccccacatgt 
2521 gaagtgaaat cctgtgatga cttcatgggc caacttctta atggccgtgt gctatttcca 
2581 gtaaatctcc agcttggagc aaaagtggat tttgtttgtg atgaaggatt tcaattaaaa 
50 2641 ggcagctctg ctagttactg tgtcttggct ggaatggaaa gcctttggaa tagcagtgtt 
, 2701 ccagtgtgtg aacaaatctt ttgtccaagt cctccagtta ttcctaatgg gagacacaca 
2761 ggaaaacctc tggaagtctt tccctttggg aaaacagtaa attacacatg cgacccccac 
2821 ccagacagag ggacgagctt cgacctcatt ggagagagca ccatccgctg cacaagtgac 
2881 cctcaaggga atggggtttg gagcagccct gcccctcgct gtggaattct gggtcactgt 
55 2941 caagccccag atcattttct gtttgccaag ttgaaaaccc aaaccaatgc atctgacttt 
3001 cccattggga catctttaaa gtacgaatgc cgtcctgagt actacgggag gccattctct 
3061 atcacatgtc tagataacct ggtctggtca agtcccaaag atgtctgtaa acgtaaatca 
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3121 tgtaaaactc ctccagatcc agtgaatggc 
3181 ggatccagaa tcaactattc ttgtactaca 
3241 gaatgtatcc tctcgggcaa tgctgcccat 
3301 attccttgtg ggctaccccc caccatcgcc 
3361 aattttcact atggatcagt ggtgacctac 
3421 gtgtttgagc ttgtgggtga gccctccata 
3481 atctggagcg gcccggcccc tcagtgcatt 
3541 gaaaatggaa tattggtatc tgacaacaga 
3601 tttaggtgtc agcctggctt tgtcatgaaa 
3661 aacaaatggg agccggagct accaagctgc 
3721 ctgcatgctg agcgtaccca aagggacaag 
3781 tacagctgtg agcccggcta tgacctcaga 
3841 ggagactgga gccctgcagc ccccacatgt 
3901 caacttctta atggccgtgt gctatttcca 
3961 tttgtttgtg atgaaggatt tcaattaaaa 
4021 ggaatggaaa gcctttggaa tagcagtgtt 
4081 cctccagtta ttcctaatgg gagacacaca 
4141 aaagcagtaa attacacatg cgacccccac 
4201 ggagagagca ccatccgctg cacaagtgac 
4261 gcccctcgct gtggaattct gggtcactgt 
4321 ttgaaaaccc aaaccaatgc atctgacttt 
4381 cgtcctgagt actacgggag gccattctct 
4441 agtcccaaag atgtctgtaa acgtaaatca 
4501 atggtgcatg tgatcacaga catccaggtt 
4561 gggcaccgac tcattggtca ctcatctgct 
4621 tggagcacga agccgccaat ttgtcaacga 
4681 aatggagatt tcattagcac caacagagag 
4741 cgctgcaatc ttggaagcag agggagaaag 
4801 tactgcacca gcaatgacga tcaagtgggc 
4861 atacctaaca aatgcacgcc tccaaatgtg 
4921 agcttatttt ccttaaatga agttgtggag 
4981 ggaccccgcc gtgtgaagtg ccaggccctg 
5041 tccagggtgt gtcagccgcc tccagaaatc 
5101 gacaactttt cacctgggca ggaagtgttc 
5161 ggggctgcgt ctctgcactg cacaccccag 
5221 gcagtgaaat cctgtgatga cttcttgggt 
5281 cttaatctcc agcttggggc aaaggtgtcc 
5341 ggcagttccg ttagtcattg tgtcttggtt 
5401 cctgtgtgtg aacatatctt ttgtccaaat 
5461 ggaactccct ctggagatat tccctatgga 
5521 ccagacagag ggatgacctt caacctcatt 
5581 cctcatggga atggggtttg gagcagccct 
5641 ggtcactgta aaaccccaga gcagtttcca 
5701 tttgagtttc cagtcgggac atctttgaat 
5761 atgttctcta tctcctgcct agaaaacttg 
5821 cgaaaatcat gtggacctcc accagaaccc 
5881 acacagtttg gatcaacagt taattattct 
5941 ccatctacta cttgtctcgt ctcaggcaat 
6001 tgtgagatca tatcttgtga gccacctcca 
6061 aatagaacat cttttcacaa tggaacggtg 
6121 ggagaacagc tgtttgagct tgtgggagaa 
6181 caagttggtg tttggagcag ccctccccct 
6241 ccagaagttg aaaatgcaat tagagtacca 
6301 atcgtcagat ttagatgtca gcccgggttt 
6361 cagaccaatg gcagatgggg gcccaagctg 
6421 ccagaaatcc tgcatggtga gcatacccta 
6461 gaagtgttct acagctgtga gcccagctat 
6541 acgccccagg gagactggag ccctgaagcc 
6601 ttcctgggcc aactccctca tggccgtgtg 
6661 aaggtgtcct ttgtttgcga tgaagggttc 
6721 gtcttggctg gaatgaaagc cctttggaat 



atggtgcatg tgatcacaga catccaggtt 
gggcaccgac tcattggtca ctcatctgct 
tggagcacga agccgccaat ttgtcaacga 
aatggagatt tcattagcac caacagagag 
cgctgcaatc ctggaagcgg agggagaaag 
tactgcacca gcaatgacga tcaagtgggc 
atacctaaca aatgcacgcc tccaaatgtg 
agcttatttt ccttaaatga agttgtggag 
ggaccccgcc gtgtgaagtg ccaggccctg 
tccagggtat gtcagccacc tccagatgtc 
gacaactttt cacccgggca ggaagtgttc 
ggggctgcgt ctatgcgctg cacaccccag 
gaagtgaaat cctgtgatga cttcatgggc 
gtaaatctcc agcttggagc aaaagtggat 
ggcagctctg ctagttattg tgtcttggct 
ccagtgtgtg aacaaatctt ttgtccaagt 
ggaaaacctc tggaagtctt tccctttgga 
ccagacagag ggacgagctt cgacctcatt 
cctcaaggga atggggtttg gagcagccct 
caagccccag atcattttct gtttgccaag 
cccattggga catctttaaa gtacgaatgc 
atcacatgtc tagataacct ggtctggtca 
tgtaaaactc ctccagatcc agtgaatggc 
ggatccagaa tcaactattc ttgtactaca 
gaatgtatcc tctcaggcaa tactgcccat 
attccttgtg ggctaccccc aaccatcgcc 
aattttcact atggatcagt ggtgacctac 
gtgtttgagc ttgtgggtga gccctccata 
atctggagcg gccccgcccc tcagtgcatt 
gaaaatggaa tattggtatc tgacaacaga 
tttaggtgtc agcctggctt tgtcatgaaa 
aacaaatggg agccagagtt accaagctgc 
ctgcatggtg agcatacccc aagccatcag 
tacagctgtg agcctggcta tgacctcaga 
ggagactgga gccctgaagc cccgagatgt 
caactccctc atggccgtgt gctatttcca 
tttgtctgtg atgaagggtt tcgcttaaag 
ggaatgagaa gcctttggaa taacagtgtt 
cctccagcta tccttaatgg gagacacaca 
aaagaaatat cttacacatg tgacccccac 
ggggagagca ccatccgctg cacaagtgac 
gcccctcgct gtgaactttc tgttcgtgct 
tttgccagtc ctacgatccc aattaatgac 
tatgaatgcc gtcctgggta ttttgggaaa 
gtctggtcaa gtgttgaaga caactgtaga 
ttcaatggaa tggtgcatat aaacacagat 
tgtaatgaag ggtttcgact cattggttcc 
aatgtcacat gggataagaa ggcacctatt 
accatatcca atggagactt ctacagcaac 
gtaacttacc agtgccacac tggaccagat 
cggtcaatat attgcaccag caaagatgat 
cggtgtattt ctactaataa atgcacagct 
ggaaacagga gtttcttttc cctcactgag. 
gtcatggtag ggtcccacac tgtgcagtgc 
ccacactgct ccagggtgtg tcagccgcct 
agccatcagg acaacttttc acctgggcag 
gacctcagag gggctgcgtc tctgcactgc 
cctagatgta cagtgaaatc ctgtgatgac 
ctacttccac ttaatctcca gcttggggca 
cgattaaaag gcaggtctgc tagtcattgt 
agcagtgttc cagtgtgtga acaaatcttt 
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6781 tgtccaaatc ctccagctat ccttaatggg agacacacag gaactccctt tggagatatt 
6841 ccctatggaa aagaaatatc ttacgcatgc gacacccacc cagacagagg gatgaccttc 
6901 aacctcattg gggagagctc catccgctgc acaagtgacc gtcaagggaa tggggtttgg 
6961 agcagccctg cccctcgctg tgaactttct gttcctgctg cctgcccaga tccacccaag 
5 7021 atccaaaacg ggcattacat tggaggacac gtatctctat atcttcctgg gatgacaatc 
7081 agctacattt gtgaccccgg ctacctgtta gtgggaaagg gcttcatttt ctgtacagac 
7141 cagggaatct ggagccaatt ggatcattat tgcaaagaag taaattgtag cttcccactg 
7201 tttatgaatg gaatctcgaa ggagttagaa atgaaaaaag tatatcacta tggagattat 
7261 gtgactttga agtgtgaaga tgggtatact ctggaaggca gtccctggag ccagtgccag 

10 7321 gcggatgaca gatgggaccc tcctctggcc aaatgtacct ctcgtgcaca tgatgctctc 
7381 atagttggca ctttatctgg tacgatcttc tttattttac tcatcatttt cctctcttgg 
7441 ataattctaa agcacagaaa aggcaataat gcacatgaaa accctaaaga agtggctatc 
7501 catttacatt ctcaaggagg cagcagcgtt catccccgaa ctctgcaaac aaatgaagaa 
7561 aatagcaggg tccttccttg acaaagtact atacagctga agaacatctc gaatacagtt 

15 7621 ttggtgggaa aggagccaat tgatttcaac agaatcagat ctgagcttca taaagtcttt 
7681 gaagtgactt cacagagacg cagacatgtg cacttgaaga tgctgcccct tccctggtac 
7741 ctagcaaagc tcctgcctct ttgtgtgcgt cactgtgaaa cccccaccct tctgcctcgt 
7801 gctaaacgca cacagtatct agtcagggga aaagactgca tttaggagat agaaaatagt 
.7861 ttggattact taaaggaata aggtgttgcc tggaatttct ggtttgtaag gtggtcactg 

20 7921 ttctttttta aaatatttgt aatatggaat gggctcagta agaagagctt ggaaaatgca 
7981 gaaagttatg aaaaataagt cacttataat tatgctacct actgataacc actcctaata 
B041 ttttgattca ttttctgcct atcttctttc acatatgtgt ttttttacat acgtcctttt 
8101 ccccccagtt tgtttccttt tattttatag agcagaaccc tagtctttta aaccagttta 
8161 gagtgaaata tatgctatat cagtttttac tttctctagg gagaaaaatt aatttactag 

25 8221 aaaggcatga aatgatcatg ggaagagtgg ttaagactac tgaagagaaa tatttggaaa 
8281 ataagatttc gatatcttct ttttttttga gatggagtct ggctctgtct cccaggctgg 
8341 agtgcagtgg cgtaatctcg gctcactgca agctccgcct cctgggttga caccattttc 
8401 ctgcctcagc ctcctgagta gttgggacta ccagtagatg ggactacagg cacctgccaa 
8461 cacgcccggc taattttttt gtatttttag tagagacggg gtttcaccat gttagccagg 

30 8521 atggtctgga tctcctgacc tcgtgatcca cccgcctcgg cctcccaaag tgctgcgatt 
8581 acaggcatga gccaccgcgc ctggccgctt tcgatatttt ctaaacttta attcaaaagc 
8641 actttgtgct gtgttctata taaaaaacat aataaaaatt gaaatgaaag aataattgtt 
8701 attataaaag tactagctta cttttgtatg gattcagaat atactaaatt aactttttaa 
8761 aacacaactt ttaaaaaatg tatcaaaaat aataaacgtg ttctgatatt ttta 



SEQE>NO:53 

Amino acid sequence of human CR1 encoded by the DNA sequence shown in SEQ ID NO: 

52. 

MGASSPRSPEPVGPPAPGLPFCCGGSLLAVWLLAL^ 

40 FEFPIGTYLNYECRPGYSGRPFSIICLKNSVWTGAKDRCRRKSCRNPPDPVNGIWHVIKG 
IQFGSQIKySCTKGYRLIGSSSATCIISGDTVIWDNETPICDRIPCGLPPTITNGDFIST 
NRENFHYGSWTYRCNPGSGGRKVFELVGEPSIYCTSNDDQVGIWSGPAPQCI I PNKCTP 
PNVENGILVSDNRSLFSLNEWEFRCQPGFVMKGPRRVKCQALNKWEPELPSCSRVCQPP 
PDVLHAERTQRDKDMFSPGQEVFYSCEPGYDLRGAASMRCTPQGDWSPAAPTCEVKSCDD 

45 FMGQLLNGRVLFPVNLQLGAKVDFVCDEGFQLKGSSASYCVLAGMESLWNSSVPVCEQI F 
CPSPPVIPNGRHTGKPLEVFPFGKAVNYTCDPHPDRGTSFDLIGESTIRCTSDPQGNGVW 
SSPAPRCGILGHCQAPDHFLFAKLKTQTNASDFPIGTSLKYECRPEYYGRPFSITCLDNL 
VWSSPKDVCKRKSCKTPPDPVNGMVHVITDIQVGSRINYSCTTGHRLIGHSSAECILSGN 
AAHWSTKPPIC^RIPCGLPPTIANGDFISTNRENFHYGSWTYRCNPGSGGRKVFELVGE 

50 PSIYCTSNDDQVGIWSGPAPQCIIPNKCTPPNVENGHiVSDNRSLFSLNEWEFRCQPGF 
VMKGPRRVKCQALNKWEPELPSCSRVCQPPPDVLHAERTQRDKDNFSPGQEVFYSCEPGY 
DLRGAAS MRCTPQGD WS PAAPTCEVKS CDDFMGOLLNGRVLFPVNLQLGAKVDFVCDEGF 
QLKGSSASYO/LAGMESLWNSSVPVCEQIFCPSPPVIPNGRHTGKPLEVFPTOKTVNYTC 
DPHPDRGTS PDL IGE ST IRCTSDPQGNGVWS S PAPRCG I LGHCQAPDH FL F AKLKTQTNA 

55 SDFPIGTSLKYECRPEYYGRPFS ITCLDNLVWSS PKDVCKRKSCKTPPDPVNGMVHVITD 
IQVGSRINYSCTTGHRLIGHSSAECILSGNAAHWSTKPPICQRIPCGLPPTIANGDFIST 
NRENFHYGSWTYRCNPGSGGRKVFELVGEPS I YCTSNDDQVGI WSGPAPQCI I PNKCTP 
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PNVENGILVSDNRSIiFSLNEVVEFRCQPGPWKGPRRVKCQALNKWEPBLPSCSRVCQPP 
PDV14IAERTQRDKDNFSP<%EVPYSCEPGYDLRGAASMRCTPQGDWSPAAPTCEVKSCDD 
FMGQLLNGR VLF PVNLQLGAKVDF VCDEGFQLKGS S AS YCVLAGMBS LWNSS VP VCEQI F 
CPSPPVIPNGRHTGKPLEVFPFGKAVNYTCDPHPDRGTSFDLIGESTIRCTSDPQGNGVW 
5 SSPAPRCGILGHCQAPDHFLFAKLKTQTNASDFPIGTSLKYECRPEYYGRPFSITCLDNL 
VWSSPKDVCKRKSCKTPPDPVNGKVHVITDIQVGSRINYSCTTGHRLIGHSSAECILSGN 
TAHWSTKPPICQRIPCGLPPTIANGDFISTNRENFHYGSWTYRCNIiGSRGRKVFELVGE 
PSI YCTSNDDQVGI WSGPAPQCI I PNKCTPPNVENGILVSDNRSLFSLNE WEFRCQPGF 
VMKGPRRVKCQALNKWEPELPSCSRVCQPPPEILHGEHTPSHQDNFSPGQEVFYSCEPGY 

1 0 DLRGAASLHCTPQGDWS PEAPRCAVKSCDDFLGQIiPHGRVLFPLNLQLGAKVSFVCDEGF 
RLKGSSVSHCVLVGMRSLWNNSVPVCEHIFCPNPPAILNGRHTGTPSGDIPYGKEISYTC 
DPHPDRGMTFNIiIGBSTIRCTSDPHGNGVWSSPAPRCELSVRAGHCKTPEQFPFASPTIP 
IOTFEFPVGTSLNYECRPGYFGKMFSISCLENLVWSSVEDNORRKSCGPPPEPFNGMVHI 
NTin^FGSTVNYSCNEGFRLIGSPSTTCLVSG 

1 5 YSNNRTSFHNGTWTYOCHTGPDGEQLFELVGERSI YCTSKDDQVGVWSSPPPRCISTNK 
CTAPEVENAIRVPGNRSFFSLTEIVRFRCQPGFVMVGSHTVQCQTNGRWGPKLPHCSRVC 
QPPPEILHGEHTLSHQDNFSPGQEVFYSCEPSYDLRGAASLHCTPQGDWSPEAPRCTVKS 
CDDFLGQLPHGRVLLPLNIiQLGAKVS FVCDEGFRLKGRSASHCVLAGMKALWNS S VPVCE 
QIFCPNPPAILNGRHTGTPFGDIPYGKEISYACDTHPDRGMTFNLIGESSIRCTSDRQGN 

20 GVWSSPAPRCELSVPAACPDPPKIQNGHYIGGHVSLYLPGMTISYICDPGYLLVGKGFIF 
CTDQGIWSQLDHYCKEVNCSFPLFMNGISKELEMKKVYHYGDYVTLKCEDGYTLEGSPWS 
QCQADDRWDPPLAKCTSRAHDAIjI VGTLSGTI FFI LLI I FLS WI ILKHRKGNNAHENPKE 
VAIHLHSQGGSSVHPRTLQTNEENSRVLP 

SEQIDNO: 54 

25 gi|18490996|ref]NMJ)00573.2| Homo sapiens complement component (3b/4b) receptor 1, 
including Knops blood group system (CR1), transcript variant F, mRNA 

1 acactctggg cgcggagcac aatgattggt cactcctatt ttcgctgagc ttttcctctt 
61 atttcagttt tcctcgagat caaatctggt ttgtagatgt gcttggggag aatgggggcc 
121 tcttctccaa gaagcccgga gcctgtcggg ccgccggcgc ccggtctccc cttctgctgc 
30 181 ggaggatccc tgctggcggt tgtggtgctg cttgcgctgc cggtggcctg gggtcaatgc 
241 aatgccccag aatggcttcc atttgccagg cctaccaacc taactgatga gtttgagttt 
301 cccattggga catatctgaa ctatgaatgc cgccctggtt attccggaag accgttttct 
361 atcatctgcc taaaaaactc agtctggact ggtgctaagg acaggtgcag acgtaaatca 
421 tgtcgtaatc ctccagatcc tgtgaatggc atggtgcatg tgatcaaagg catccagttc 
35 481 ggatcccaaa ttaaatattc ttgtactaaa ggataccgac tcattggttc ctcgtctgcc 
541 acatgcatca tctcaggtga tactgtcatt tgggataatg aaacacctat ttgtgacaga 
601 attccttgtg ggctaccccc caccatcacc aatggagatt tcattagcac caacagagag 
661 aattttcact atggatcagt ggtgacctac cgctgcaatc ctggaagcgg agggagaaag 
721 gtgtttgagc ttgtgggtga gccctccata tactgcacca gcaatgacga tcaagtgggc 
40 781 atctggagcg gccccgcccc tcagtgcatt atacctaaca aatgcacgcc tccaaatgtg 
841 gaaaatggaa tattggtatc tgacaacaga agcttatttt ccttaaatga agttgtggag 
901 tttaggtgtc agcctggctt tgtcatgaaa ggaccccgcc gtgtgaagtg ccaggccctg 
961 aacaaatggg agccggagct accaagctgc tccagggtat gtcagccacc tccagatgtc 
1021 ctgcatgctg agcgtaccca aagggacaag gacaactttt cacctgggca ggaagtgttc 
45 1081 tacagctgtg agcccggcta cgacctcaga ggggctgcgt ctatgcgctg cacaccccag 
1141 ggagactgga gccctgcagc ccccacatgt gaagtgaaat cctgtgatga cttcatgggc 
1201 caacttctta atggccgtgt gctatttcca gtaaatctcc agcttggagc aaaagtggat 
1261 tttgtttgtg atgaaggatt tcaattaaaa ggcagctctg ctagttactg tgtcttggct 
1321 ggaatggaaa gcctttggaa tagcagtgtt ccagtgtgtg aacaaatctt ttgtccaagt 
50 1381 cctccagtta ttcctaatgg gagacacaca ggaaaacctc tggaagtctt tccctttgga 
1441 aaagcagtaa attacacatg cgacccccac ccagacagag ggacgagctt cgacctcatt 
1501 ggagagagca ccatccgctg cacaagtgac cctcaaggga atggggtttg gagcagccct 
1561 gcccctcgct gtggaattct gggtcactgt caagccccag atcattttct gtttgccaag 
1621 ttgaaaaccc aaaccaatgc atctgacttt cccattggga catctttaaa gtacgaatgc 
55 1681 cgtcctgagt actacgggag gccattctct atcacatgtc tagataacct ggtctggtca 
1741 agtcccaaag atgtctgtaa acgtaaatca tgtaaaactc ctccagatcc agtgaatggc 
1801 atggtgcatg tgatcacaga catccaggtt ggatccagaa tcaactattc ttgtactaca 
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1861 gggcaccgac tcattggtca ctcatctgct gaatgtatcc tctcgggcaa tgctgcccat 
1921 tggagcacga agccgccaat ttgtcaacga attccttgtg ggctaccccc caccatcgcc 
1981 aatggagatt tcattagcac caacagagag aattttcact atggatcagt ggtgacctac 
2041 cgctgcaatc ctggaagcgg agggagaaag gtgtttgagc ttgtgggtga gccctccata 
5 2101 tactgcacca gcaatgacga tcaagtgggc atctggagcg gcccggcccc tcagtgcatt 
2161 atacctaaca aatgcacgcc tccaaatgtg gaaaatggaa tattggtatc tgacaacaga 
2221 agcttatttt ccttaaatga agttgtggag tttaggtgtc agcctggctt tgtcatgaaa 
2281 ggaccccgcc gtgtgaagtg ccaggccctg aacaaatggg agccggagct accaagctgc 
2341 tccagggtat gtcagccacc tccagatgtc ctgcatgctg agcgtaccca aagggacaag 
10 2401 gacaactttt cacccgggca ggaagtgttc tacagctgtg agcccggcta tgacctcaga 
2461 ggggctgcgt ctatgcgctg cacaccccag ggagactgga gccctgcagc ccccacatgt 
2521 gaagtgaaat cctgtgatga cttcatgggc caacttctta atggccgtgt gctatttcca 
2581 gtaaatctcc agcttggagc aaaagtggat tttgtttgtg atgaaggatt tcaattaaaa 
2641 ggcagctctg ctagttattg tgtcttggct ggaatggaaa gcctttggaa tagcagtgtt 
15 2701 ccagtgtgtg aacaaatctt ttgtccaagt cctccagtta ttcctaatgg gagacacaca 
2761 ggaaaacctc tggaagtctt tccctttgga aaagcagtaa attacacatg cgacccccac 
2821 ccagacagag ggacgagctt cgacctcatt ggagagagca ccatccgctg cacaagtgac 
2881 cctcaaggga atggggtttg gagcagccct gcccctcgct gtggaattct gggtcactgt 
2941 caagccccag atcattttct gtttgccaag ttgaaaaccc aaaccaatgc atctgacttt 

20 3001 cccattggga catctttaaa gtacgaatgc cgtcctgagt actacgggag gccattctct 
3061 atcacatgtc tagataacct ggtctggtca agtcccaaag atgtctgtaa acgtaaatca 
3121 tgtaaaactc ctccagatcc agtgaatggc atggtgcatg tgatcacaga catccaggtt 
3181 ggatccagaa tcaactattc ttgtactaca gggcaccgac tcattggtca ctcatctgct 
3241 gaatgtatcc tctcaggcaa tactgcccat tggagcacga agccgccaat ttgtcaacga 

25 3301 attccttgtg ggctaccccc aaccatcgcc aatggagatt tcattagcac caacagagag 
3361 aattttcact atggatcagt ggtgacctac cgctgcaatc ttggaagcag agggagaaag 
3421 gtgtttgagc ttgtgggtga gccctccata tactgcacca gcaatgacga tcaagtgggc 
34 81 atctggagcg gccccgcccc tcagtgcatt atacctaaca aatgcacgcc tccaaatgtg 
3541 gaaaatggaa tattggtatc tgacaacaga agcttatttt ccttaaatga agttgtggag 

30 3601 tttaggtgtc agcctggctt tgtcatgaaa ggaccccgcc gtgtgaagtg ccaggccctg 
3661 aacaaatggg agccagagtt accaagctgc tccagggtgt gtcagccgcc tccagaaatc 
3721 ctgcatggtg agcatacccc aagccatcag gacaactttt cacctgggca ggaagtgttc 
3781 tacagctgtg agcctggcta tgacctcaga ggggctgcgt ctctgcactg cacaccccag 
3841 ggagactgga gccctgaagc cccgagatgt gcagtgaaat cctgtgatga cttcttgggt 

35 3901 caactccctc atggccgtgt gctatttcca cttaatctcc agcttggggc aaaggtgtcc 
3961 tttgtctgtg atgaagggtt tcgcttaaag ggcagttccg ttagtcattg tgtcttggtt 
4021 ggaatgagaa gcctttggaa taacagtgtt cctgtgtgtg aacatatctt ttgtccaaat 
4081 cctccagcta tccttaatgg gagacacaca ggaactccct ctggagatat tccctatgga 
4141 aaagaaatat cttacacatg tgacccccac ccagacagag ggatgacctt caacctcatt 

40 4201 ggggagagca ccatccgctg cacaagtgac cctcatggga atggggtttg gagcagccct 
4261 gcccctcgct gtgaactttc tgttcgtgct ggtcactgta aaaccccaga gcagtttcca 
4321 tttgccagtc ctacgatccc aattaatgac tttgagtttc cagtcgggac atctttgaat 
4381 tatgaatgcc gtcctgggta ttttgggaaa atgttctcta tctcctgcct agaaaacttg 
4441 gtctggtcaa gtgttgaaga caactgtaga cgaaaatcat gtggacctcc accagaaccc 

45 4501 ttcaatggaa tggtgcatat aaacacagat acacagtttg gatcaacagt taattattct 
4561 tgtaatgaag ggtttcgact cattggttcc ccatctacta cttgtctcgt ctcaggcaat 
4621 aatgtcacat gggataagaa ggcacctatt tgtgagatca tatcttgtga gccacctcca 
4681 accatatcca atggagactt ctacagcaac aatagaacat cttttcacaa tggaacggtg 
4741 gtaacttacc agtgccacac tggaccagat ggagaacagc tgtttgagct tgtgggagaa 

50 4801 cggtcaatat attgcaccag caaagatgat caagttggtg tttggagcag ccctccccct 
4861 cggtgtattt ctactaataa atgcacagct ccagaagttg aaaatgcaat tagagtacca 
4921 ggaaacagga gtttcttttc cctcactgag atcatcagat ttagatgtca gcccgggttt 
4981 gtcatggtag ggtcccacac tgtgcagtgc cagaccaatg gcagatgggg gcccaagctg 
5041 ccacactgct ccagggtgtg tcagccgcct ccagaaatcc tgcatggtga gcatacccta 

55 5101 agccatcagg acaacttttc acctgggcag gaagtgttct acagctgtga gcccagctat 
5161 gacctcagag gggctgcgtc tctgcactgc acgccccagg gagactggag ccctgaagcc 
5221 cctagatgta cagtgaaatc ctgtgatgac ttcctgggcc aactccctca tggccgtgtg 
5281 ctacttccac ttaatctcca gcttggggca aaggtgtcct ttgtttgcga tgaagggttc 
5341 cgattaaaag gcaggtctgc tagtcattgt gtcttggctg gaatgaaagc cctttggaat 

60 5401 agcagtgttc cagtgtgtga acaaatcttt tgtccaaatc ctccagctat ccttaatggg 
5461 agacacacag gaactccctt tggagatatt ccctatggaa aagaaatatc ttacgcatgc 
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5521 gacacccacc cagacagagg gatgaccttc aacctcattg gggagagctc catccgccgc 
5581 acaagtgacc ctcaagggaa tggggtttgg agcagccctg cccctcgctg tgaactttct 
5641 gttcctgctg cctgcccaca tccacccaag atccaaaacg ggcattacat tggaggacac 
5701 gtatctctat atcttcctgg gatgacaatc agctacactt gtgaccccgg ctacctgtta 
5 5761 gtgggaaagg gcttcatttt ctgtacagac cagggaatct ggagccaatt ggatcattat 
5821 tgcaaagaag taaattgtag cttcccactg tttatgaatg gaatctcgaa ggagttagaa 
5881 atgaaaaaag tatatcacta tggagattat gtgactttga agtgtgaaga tgggtatact 
5941 ctggaaggca gtccctggag ccagtgccag gcggatgaca gatgggaccc tcctctggcc 
6001 aaatgtacct ctcgtgcaca tgatgctctc atagttggca ctttatctgg tacgatcttc 

10 6061 tttattttac tcatcatttt cctctcttgg ataattctaa agcacagaaa aggcaataat 
6121 gcacatgaaa accctaaaga agtggctatc catttacatt ctcaaggagg cagcagcgtt 
6181 catccccgaa ctctgcaaac aaatgaagaa aatagcaggg tccttccttg acaaagtact 
6241 atacagctga agaacatctc gaatacaatt ttggtgggaa aggagccaat tgatttcaac 
6301 agaatcagat ctgagcttca taaagtcttt gaagtgactt cacagagacg cagacatgtg 

IS 6361 cacttgaaga tgctgcccct tccctggtac ctagcaaagc tcctgcctct ttgtgtgcgt 
6421 cactgtgaaa cccccaccct tctgcctcgt gctaaacgca cacagtatct agtcagggga 
6481 aaagactgca tttaggagat agaaaatagt ttggattact taaaggaata aggtgttgcc 
6541 tggaatttct ggtttgtaag gtggtcactg ttctttttta aaatatttgt aatatggaat 
6601 gggctcagta agaagagctt ggaaaatgca gaaagttatg aaaaataagt cacttataat 

20 6661 tatgctacct actgataacc actcctaata ttttgattca ttttctgcct atcttctttc 
6721 acatatgtgt ttttttacat acgtactttt ccccccttag tttgtttcct tttattttat 
6781 agagcagaac cctagtcttt taaacagttt agagtgaaat atatgctata tcagttttta 
6841 ctttctctag ggagaaaaat taatttacta gaaaggcatg aaatgatcat gggaagagtg 
6901 gttaagacta ctgaagagaa atatttggaa aataagattt cgatatcttc tttttttttg 

25 6961 agatggagtc tggctctgtc tcccaggctg gagtgcagtg gcgtaatctc ggctcactgc 
7021 aacgtccgcc tcccgggttg acaccatttt cctgcctcag cctcctgagt agttgggact 
7081 accagtagat gggactacag gcacctgcca acacgcccgg ctaatttttt tgtattttta 
7141 gtagagacgg ggtttcacca tgttagccag gatggtctgg atctcctgac ctcgtgatcc 
7201 acccgcctcg gcctcccaaa gtgctgcgat tacaggcatg agccaccgcg cctggccgct 

30 7261 ttcgatattt tctaaacttt aattcaaaag cactttgtgc tgtgttctat ataaaaaaca 
7321 taataaaaat tgaaatgaaa gaataattgt tattataaaa gtactagctt acttttgtat 
7381 ggattcagaa tatactaaat taacttttta aaacacaact tttaaaaaat gtatcaaaaa 
7441 taataaacgt gttctgatat tttta 

SEQ ID NO: 55 

35 Amino acid sequence of human CR1 variant ORF number 1 encoded by the DNA sequence 
shown in SEQ ID NO: 54. 

MGASSPRSPEPVGPPAPGLPFCCGGSLIAVVVLU^ 

FEPP IGTYLNYECRPG YSGRPFS 1 1 CLKNSVWTGAKDRCRRKSCRNPPDPVNGMVHVI KG 
IQPGSQIKYSCTKGYRLIGSSSATCIISGDTVIWDNBTPICDRIPCGLPPTITNGDFIST 

40 NRENFHYGSVVT YRCNPGSGGRKVFELVGEPS I YCTSNDDQVGI WSGPAPQCI I PNKCTP 
PNVENGILVSDNRSLFSI-NEWEPRCQPGFVMKGPRRVKCQALNKWEPBIiPSCSRVCQPP 
PD VLHAERTQRDKDN FS PGQEVF YSCEPG YDLRGAAS MRCTPQGD WS PAAPTCE VKSCDD 
FMGQLLNGRVLPPVNLQLGAKVDFVCDEGFQLKGSSASYCVLAGMESLWNSSVPVCEQIF 
CPSPPVIPNGRHTGKPLEVFPFGKAVNYTCDPHPDRGTSFDLIGESTIRCTSDPQGNGVW 

45 SSPAPRCG ILGHCQAPDHFLFAKLKTQTNASDFPIGTSLKYECRPEYYGRPFS I TCLDNL 
WSSPKDVCKRKSCKTPPDPVNGMVHVITDIQVGSRIKYSCTTGHRLIGHSSAECILSGN 
AAHWSTKPP I CQRI P CGL PPTIANGDF I STNRENFHYGSVVTYRCNPG SGGRKVFEL VGE 
PS I YCTSNDDQVG I W SG PAPQC 1 1 PNKCTPPNVENGI LV SDNRS LFSLNEWE FRCQ PG F 
VMKX3PRRVKCQALNKWEPBLP8C8RVC0PPPDVLHABRTQRDKZ>NFSPGQEVFYSCEPGY 

50 DLRGAASMRCTPQGDWSPAAPTCEVKSCDDFMGQLLNGRVLFPVNIiQLGAKN^FVCDEGF 
QLKGSSASYCVLAGMESLWNSSVFVCEQI FCPSPPVI PNGRHTGKPJL»EVFPFGKAVNYTC 
DPHPDRGTSFDLIGESTIRCTSDPQGNGWSSPAPRCGaLGHCQAPDHFLFAKLKTQTNA 
SDFPIGTSLKYECRPEYYGRPFSITCLDNLVWSSPKDVCKRKSCKTPPDPVNGMVHVITD 
IQVGSRINYSCTTGHRLIGHSSAECILSGNTAHWSTKPPICQRIPCGLPPTIANGDFIST 

55 NRENFHYGSVVTYRCNLGSRGRKVFELVGEPS 1 YCTSNDDQVGI WSGPAPQCI I PNKCTP 
PNVENGILVSDNRSLFSLNEVVEFRCQPGFVMKGPRRVKCQALNKWBPEIjPSCSRVCQPP 

peilhgehtpshqdnfspgqevfyscepgydlrgaasi*hctpqgdwspeaprcavkscdd 
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FU3QLPHGRVLFPLNLQLQAKVS FVCDEGFRLKGSS VSHCVLVGMRSLWNNS VPVCEHI F 
CPNPPAILNGRHTGTPSGDIPYGKEISYTCDPHPDRGMTFNLIGESTIRCTSDPHONGVW 
SSPAPRCBLSVRAGHCKTPEQFPFASPTIPINDFBFPVGTSLNYECRPGYFGKMFSISCL 
BNLVWSSVEDNCRRKSCGPPPEPFNGMVHIimJTQFGSTVNYSCNEGFRLIGSPSTTCLV 
5 SGNNVTWDKKAPICEIISCEPPPTISNGDFYSNNRT^^ 

VGERSIYCTSKDDQVGVWSSPPPRCISTNKCTAPEVENAIRVPGNRSFFSLTEIIRFRCQ 
PGFVMVGSHTVQCQTNGRWGPKLPHCSRVCQPPPEILHGEHTLSHQDNFSPGQEVPYSCE 
PSYDLRGAASLHCTPQGDWSPEAPRCTVKSCDDFLGQLPHGRVLLPLNLQLGAKVSFVCD 
EGFRLKGRSASHCVliAGMKALWNSSVPVCEQIFCPNPPAILNGRHTGTPFGDIPYGKEIS 
1 0 YACDTHPDRGMTFNL I GES S I RCTSDPQGNG VWS S P APRCELS VPAACPHPPKI QNGHY I 
GGHVSL YL PGMT I S YTCDPG YLLVGKG F I FCTDQG I WSQLDHYCKEVNCS FPLFMNG I S K 
BLEMKKVYHYGDYWLKCEDGYTLEGSPWSQCQADDRWDPPLAKCTSRAKDALIVGTLSG 
TIFFILLIIFLSWIILKHRKGNNAHENPKEVAIHLHSQGGSSVHPRTLQTNEENSRVIiP 

SEQE>NO:56 

1 5 gi|27262658|ref|KM_00521 1 .2| Homo sapiens colony stimulating factor 1 receptor, formerly 
McDonough feline sarcoma viral (v-fins) oncogene homolog (CSF1R), mRNA 

1 gaagggcaga cagagtgtcc aaaagcgtga gagcacgaag tgaggagaag gtggagaaga 
61 gagaagagga agaggaagag gaagagagga agcggaggga actgcggcca ggctaaaagg 
121 ggaagaagag gatcagccca aggaggagga agaggaaaac aagacaaaca gccagtgcag 
20 181 aggagaggaa cgtgtgtcca gtgtcccgat ccctgcggag ctagtagctg agagctctgt 
241 gccctgggca ccttgcagcc ctgcacctgc ctgccacttc cccaccgagg ccatgggccc 
301 aggagttctg ctgctcctgc tggtggccac agcttggcat ggtcagggaa tcccagtgat 
361 agagcccagt gtccctgagc tggtcgtgaa gccaggagca acggtgacct tgcgatgtgt 
421 gggcaatggc agcgtggaat gggatggccc cccatcacct cactggaccc tgtactctga 
25 481 tggctccagc agcatcctca gcaccaacaa cgctaccttc caaaacacgg ggacctatcg 
541 ctgcactgag cctggagacc ccctgggagg cagcgccgcc atccacctct atgtcaaaga 
601 ccctgcccgg ccctggaacg tgctagcaca ggaggtggtc gtgttcgagg accaggacgc 
661 actactgccc tgtctgctca cagacccggt gctggaagca ggcgtctcgc tggtgcgtgt 
721 gcgtggccgg cccctcatgc gccacaccaa ctactccttc tcgccctggc atggcttcac 
30 781 catccacagg gccaagttca ttcagagcca ggactatcaa tgcagtgccc tgatgggtgg 
841 caggaaggtg atgtccatca gcatccggct gaaagtgcag aaagtcatcc cagggccccc 
901 agccttgaca ctggtgcctg cagagctggt gcggattcga ggggaggefcg cccagatcgt 
961 gtgctcagcc agcagcgttg atgtCaactt tgatgtcttc ctccaacaca acaacaccaa 
1021 gctcgcaatc cctcaacaat ctgactttca taataaccgt taccaaaaag tcctgaccct 
35 1081 caacctcgat caagtagatt tccaacatgc cggcaactac tcctgcgtgg ccagcaacgt 
1141 gcagggcaag cactccacct ccatgttctt ccgggtggta gagagtgcct acttgaactt 
1201 gagctctgag cagaacctca tccaggaggt gaccgtgggg gaggggctca acctcaaagt 
1261 catggtggag gcctacccag gcctgcaagg ttttaactgg acctacctgg gacccttttc 
1321 tgaccaccag cctgagccca agcttgctaa tgctaccacc aaggacacat acaggcacac 
40 1381 cttcaccctc tctctgcccc gcctgaagcc ctctgaggct ggccgctact ccttcctggc 
1441 cagaaaccca ggaggctgga gagctctgac gtttgagctc acccttcgat accccccaga 
1501 ggtaagcgtc atatggacat tcatcaacgg ctctggcacc cttttgtgtg ctgcctctgg 
1561 gtacccccag cccaacgtga catggctgca gtgcagtggc cacactgata ggtgtgatga 
1621 ggcccaagtg ctgcaggtct gggatgaccc ataccctgag gtcctgagcc aggagccctt 
45 1681 ccacaaggtg acggtgcaga gcctgctgac tgttgagacc ttagagcaca accaaaccta 
1741 cgagtgcagg gcccacaaca gcgtggggag tggctcctgg gccttcatac ccatctctgc 
1801 aggagcccac acgcatcccc cggatgagtt cctcttcaca ccagtggtgg tcgcctgcat 
1861 gtccatcatg gccttgctgc tgctgctgct cctgctgcta ttgtacaagt ataagcagaa 
1921 gcccaagtac caggtccgct ggaagatcat cgagagctat gagggcaaca gttatacttt 
50 1981 catcgacccc acgcagctgc cttacaacga gaagtgggag ttcccccgga acaacctgca 
2041 gtttggtaag accctcggag ctggagcctt tgggaaggtg gtggaggcea cggcctttgg 
2101 tctgggcaag gaggatgctg tcctgaaggt ggctgtgaag atgctgaagt ccacggccca 
2161 tgctgatgag aaggaggccc tcatgtccga gctgaagatc atgagccacc tgggccagca 
2221 cgagaacatc gtcaaccttc tgggagcctg tacccatgga ggccctgtac tggtcatcac 
55 2281 ggagtactgt tgctatggcg acctgctcaa ctttctgcga aggaaggctg aggccatgct 
2341 gggacccagc ctgagccccg gccaggaccc cgagggaggc gtcgactata agaacatcca 
2401 cctcgagaag aaatatgtcc gcagggacag tggcttctcc agcfcagggtg tggacaccta 
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2461 tgtggagatg aggcctgtct ccacttcttc aaatgactcc ttctptgagc aagacctgga 
2521 caaggaggat ggacggcccc tggagctccg ggacctgctt cacttctcca gccaagtagc 
2581 ccagggcatg gccttcctcg cttccaagaa ttgcatccac cgggacgtgg cagcgcgtaa 
2641 cgtgctgttg accaatggtc atgtggccaa gattggggac ttcgggctgg ctagggacat 
5 2701 catgaatgac tccaactaca ttgtcaaggg caatgcccgc ctgcctgtga agtggatggc 
2761 cccagagagc atctttgact gtgtctacac ggttcagagc gacgtctggt cctatggcat 
2821 cctcctctgg gagatcttct cacttgggct gaatccctac cctggcatcc tggtgaacag 
2881 caagttctat aaactggtga aggatggata ccaaatggcc cagcctgcat ttgccccaaa 
2941 gaatatatac agcatcatgc aggcctgctg ggccttggag cccacccaca gacccacctt 

10 3001 ccagcagatc tgctccttcc ttcaggagca ggcccaagag gacaggagag agcgggacta 
3061 taccaatctg ccgagcagca gcagaagcgg tggcagcggc agcagcagca gtgagctgga 
3121 ggaggagagc tctagtgagc acctgacctg ctgcgagcaa ggggatatcg cccagccctt 
3181 gctgcagccc aacaactatc agttctgctg aggagttgac gacagggagt accactctcc 
3241 cctcctccaa acttcaactc ctccatggat ggggcgacac ggggagaaca tacaaactct 

15 3301 gccttcggtc atttcactca acagctcggc ccagctctga aacttgggaa ggtgagggat 
3361 tcaggggagg tcagaggatc ccacttcctg agcatgggcc atcactgcca gtcaggggct 
3421 gggggctgag ccctcacccc cccctcccct actgttctca tggtgttggc ctcgtgtttg 
3481 ctatgccaac tagtagaacc ttctttccta atccccttat cttcatggaa atggactgac 
3541 tttatgccta tgaagtcccc aggagctaca ctgatactga gaaaaccagg ctctfctgggg 

20 3601 ctagacagac tggcagagag tgagatctcc ctctctgaga ggagcagcag atgctcacag 
3661 accacactca gctcaggccc cttggagcag gatggctcct ctaagaatct cacaggacct 
3721 cttagtctct gccctatacg ccgccttcac tccacagcct cacccctccc acccccatac 
3781 tggtactgct gtaatgagcc aagtggcagc taaaagttgg gggtgttctg cccagtcccg 
3841 tcattctggg ctagaaggca ggggaccttg gcatgtggct ggccacacca agcaggaagc 

25 3901 acaaactccc ccaagctgac tcatcctaac taacagtcac gccgtgggat gtctctgtcc 
3961 acattaaact aacagcatta atgca 

SEQIDNO: 57 

Amino acid sequence of human CSFR1 encoded by the DNA sequence shown in SEQ ID 
NO: 56. 

30 MOPOVLLLLLVATAWHQQGX PVI EPS VPEl* WKPQATVTLRCVGNQS VHWDGPPS PHWTL 
YSDGSSSILSTIWATFQNTGTYRCTEPGDPLGGSAAIHLWOT 

QDALL PC LLTD P VLEAG VSLVR VRGR PLMRHTNYS FS P WHO FTI HRAKF I QSQDYQCS AL 
MGGRKVMS I S I RLKVQKVI PG PPALTLVPAELVR IRGEAAQ t VCS AS S VDVNFDVFLQHN 
NTXLAIPQQSDFKNNRYQKVliTLNLDQVDFQHAGl^ 

35 LNLSSEQm*IQEVTVGEGIiNLKV^1VEAYPGLQGFNWTYLGPFSDH0PEPKLWATTKDTY 
RHTFTLSLPRLKPSEAGRYSFLARNPGGWWVLTPELTLRYPPEVSVIWTFINGSGTLL^ 
ASGYPQPNVTWLQCSGHTDRCDEAQVLQVWDDPYPEVLSQEPFHKVTVQSLLl^TLEHN 
QTYECRAHNS VGSGS WAF I P I SAGAHTHPPDEFLFTPVWACMS IMALLLLLLLLLLYKY 
KQKPKYQVRWKI I ES YEGNS YTFIDPTQLPYITCKWEFPRNNIjQFGKTLGAGAFGKVVEAT 

40 AFGLGKEDAVLKVAVKMLKSTAHADBKEALMSELKIMSHl^ 
VXTEYCCYGDLLNFLRRKAEAMLGPSLSPGQDPEGGVDYKN^ 

DTYVENRPVSTS8N0SFSEQDZ«DKEDGRPLELRDIiIiHFS8QVAQGMAFIJVSKNCIURDVA 
ARNVLLTNGHVAKIGDFGLARDIMNDS W I VKGNARLPVKWMA I FDCVYTVQSDVWS 
YGILLWEIFSLGLNPYPGILVNSKFYKL\nO}GYQMAQPAFAPKNIYSXMQACWALBPTHR 
45 PTFQQICSFXiQEQAQEDRRERDYTNLPSSSRSGGSGSSSSELEEBSSSEHLTCCEQGDXA 
QPLLQPNNYQFC 

SEQIDNO: 58 

gi|6681044|reflNM_007779.1 1 Mus musculus colony stimulating factor 1 receptor (Csflr), 
mRNA 

50 1 cagaactagc agctgggagc cccgtgccca gccgactctc caacctgcat cggctcacge 

61 tatcccctgg aggctatgga gttggggcct cctctggtcc tgctgctggc cacagtttgg 
121 catggtcagg gggcccctgt catcgagcct agtggcccag aaceggttgt agagccgggt 
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181 gaaacggtga ccctgcgatg tgtgagcaat ggcagtgtgg aatgggatgg ccccatctct 
241 cccatctgga ccttggaccc tgaatctccc ggaagcaccc tgaccacaag caacgcgacc 
301 ttcaaaaaca ctgggaccta ccgttgtacc gagcttgaag accccatggc aggcagtacc 
361 accatccact tgtatgtcaa agatccggcc cactcttgga atttgctggc acaggaggtg 
5 421 acagtggttg agggccagga agctgtgctg ccctgcctga tcactgaccc tgcactgaag 
481 gacagtgtct cactgatgqa tgaggggggc aggcaggtct tacgcaaaac ggtctacttc 
541 ttctcgccat ggcgaggttc gattatccgc aaggctaaag tccttgacag caatacctac 
601 gtgtgcaaga ccatggtgaa tggtagggaa tccacctcca ctggcatctg gcttaaggtg 
661 aatcgagtcc acccagagcc cccacagata aaattggagc ctagcaagct ggtgcggatt 
10 721 cgaggggagg ctgcgcagat cgtgtgctcg gccactaacg ccgaagtggg attcaacgtt 
781 atcctcaaac gtggagacac caagctggaa atccccctaa acagtgactt ccaagataac 
841 tattataaaa aagtccgggc tcteagtctc aacgctgtgg acttccaaga cgctggcata 
901 tattcttgtg tggccagcaa tgatgttggc acacgcacgg ccaccatgaa cttccaggtg 
961 gtggagagtg cctacttaaa cttgacctct gagcagagcc tcttgcagga ggtgtctgtg 
15 1021 ggtgacagcc tcatcctcac ggtccatgca gatgcctacc ctagcataca gcattacaac 
1081 tggacctacc taggtccatt ctttgaagac cagcgcaagc ttgagtttat cacccaaagg 
1141 gccatataca ggtacacatt caagctcttt ctgaaccgtg taaaggcctc agaggcgggc 
1201 cagtacttct taatggcaca aaacaaggca ggctggaata atetgacctt tgagctcacc 
1261 ctgcgatatc ccccagaggt cagtgttaca tggatgcctg tgaatggctc tgatgtcctg 
20 1321 ttctgtgacg tctctgggta ccctcagccc agcgtgacat ggatggagtg caggggccac 
1381 accgataggt gtgatgaagc ccaggctttg cacctttgga atgacaccca ccctgaagtc 
1441 ctgagtcaga agcccttcga caaagtgatc attcagagcc agctgcccat tggaccatta 
1501 aaacacaaca tgacttattt ttgcaaaacc cacaacagtg tgggtaacag ctctcagtac 
1561 ttcagggccg tctccctagg acaaagcaag cagctccccg atgagtccct cttcactccg 
25 1621 gtggtggtgg cctgtatgtc tgtcatgtct ctgctggtgc tactgctgtt gctgctcttg 
1681 tacaagtaca agcagaagcc gaagtaccag gtgcgctgga agatcatcga gagatacgaa 
1741 ggcaatagct acaccttcat tgaccctact cagttgccct acaatgagaa gtgggagttc 
1801 cctcggaaca acctgcagtt tggtaagact ctaggagccg gtgcctttgg gaaggtggtg 
1861 gaggctacag cctttggtct gggcaaagaa gatgcagtgc tgaaggtggc tgtgaagatg 
30 1921 ctaaagtcca cggctcatgc tgatgagaag gaggccctga tgtcagagct gaagatcatg 
1981 agtcacctgg gacagcacga gaatatagcc aacctcttgg gagcctgtac tcacggagga 
2041 cctgtcctgg tctacactga atactgctgc tatggagacc atctcaactt tctccgaagg 
2101 aaggccgagg ctatgctagg acccagcctg agtcctggtc aggactccga gggagactcc 
2161 agctacaaga acatccacct ggagaagaaa tatgtgcgca gggacagtgg cttctccagt 
35 2221 cagggtgtag acacctacgt ggagatgagg cctgtctcga cttcttcaag tgactccttc 
2281 tttaagcaag atctggacaa agagcacagc cggcccctgg agctctggga cctgctccac 
2341 ttctccagcc aagtggctca gggcatggcc ttccttgctt ctaaaaactg catccaccgg 
240.1 gacgtagcag ctcgaaacgt gctgttgacc agcggacatg tggccaagat tggggacttt 
2461 ggactggcta gggacatcat gaatgactcc aactatgttg tcaagggcaa tgcgctgcct 
40 2521 gtaaagtgga tggccccaga gagcatcttt gactgcgtca tcacagttca gagtgatgtg 
2581 tggtcctacg gcatcctcct ctgggagatc ttctcgcttg gtctgaaccc ctaccccggc 
2641 atccatgtga acaacaagtt ctacaaactg gtgaaggatg gataccaaat ggcccagcct 
2701 gtatttgcac cgaagaacat atacagcatc atgcagtcct gctgggacct ggagcctacc 
2761 agaagaccca ccttccaaca gatctgcttc ctcctccagg agcaggcccg actggagagg 
45 2821 agagaccagg actatgctaa cctgccaagc agcggtggca gcagcggcag tgacagtggt 
2881 ggtggcagca gcggtggcag cagcagtgag ccagaagagg agagctccag tgaacacctg 
2941 gcctgctgtg agccagggga catcgcccag cccctgctgc agcctaacaa ctaccagttc 
3001 tgctgaagtg ggagggagag ccgagtcctg ccgctctcta cgtcccgctt ggcctcctcc 
3061 atggcagggc gaacatgggg agaacatatg gacttcgccc tcagcttggc ccgactctga 
50 3121 cacttcagaa catgaggggt ctggggaggt cagaggcccc gtttgttccc agagcctggg 
3181 ccatcactgc cagtggggtt ctcacagtgc tagcctctat atttactatg ccaactggtg 
3241 cacccctagt tctctttctc catcctattc ccattttaaa aaacccgtcc caaactctcg 
3301 tgttteaatg gaaagactga tttatgtctc aaaagacaag agtctcaaag gctgtgggta 
3361 agctgaaggc ttgcctccct gacagatgct tagactacag gcttcttggg acaggtggcc 
55 3421 ccttcctaag ctcacaggag tggceaccac tcttgacctt cactctgtct atagtcccgc 
3481 ctctgtactg agcgtgtact gagcgggcag ctaaaaagtg ttctacccag tgccctgtca 
3541 ctctagactg gaaggtatgg ggcctgatgc aaggctgacc acaccaacaa aaaccgtgtg 
3601 ctcctctcca agtctgactc gtcctcaatt aatctgtcaa cateaaacta acagtcatta 
3661 acatc 

60 SEQE>NO:59 
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Amino acid sequence of mouse CSFRl encoded by the DNA sequence shown in SEQ ID 

NO: 58. 

MfiLGPPLVLLIATVWHGQGAPVIEPSQPELVVBPaETVTLRCVSNGSVBWDGPISPYW^ 

DPESPGSTLTTiWATFKNTOTYRCTBLBD^ 
5 QEAVLPCLITDPALKDSVSIjMRBGGRQVLRKTVYFFSPWRGFIIRKAKVI^ 

VHGR8STSTG I WLKVNRVHP EP PQ I KLEPS KLVR IRGRAAQI VCSATNAEVGFNVIL KRG 
DTKLB I PIjNSDFQDNYYIOCVRALSIiNAVDPQDAG I YSCVASNDVGTRTATMNFQVVES AY 
LNLTSEQSLLQEVSVGDSLILTVHADAYPSIQHYNWTYLG 

TFKL flnrvkas eagqyflmaqn kag wnnltfbltlr yp pbvsvt wmp vngsd vl fcd vs 
1 0 gypqpsvtwmecrghtdrctbaqalqvwndthpevlsqk^ 

yfckthnsvgnssqyfravslgqskqlpdeslftpvvvacmsvmsllvlllllllykykq 

kpkyqvrwki i er yegns ytf i d ptqlpynekwefprnnlqfgktlgagafgkwe ataf 

glgkedavlkwvkmlkstahadekeai^ 

tbyccygdu-nflrrkaeamlgpswpgqdsegdssykn^ 
1 5 yvemrpvstsssdsffkqdldkeasrplelwdlmfssqvaqgmaflaskj1cihrdvaar 

nvlltsghvakigdfglardimndshywkgnarlpvkwmapbsifdcvytv 

ILLWBIFSLGLNPYPGILVNNKFYKLVKDGYQMAQPVFAPKNIYSIMQSCWDLEPTO 
FQQICFIiLQEQARLERRDQDYANLPSSGGSSGSDSGGGS8GGSSSEPEEESSSEHLACCE 

PGDIAQPLLQPNNYQFC 

20 SEQ CD NO: 60 

gi|34932021 |ref]XM 225897.2) Rattus norvegicus similar to Macrophage colony stimulating 
factor I receptor precursor (CSF-l-R) (Fms proto-oncogene) (c-ftns) (LOC307403), mRNA 

1 atgtgcaagg ctgttgtgaa cgctagggaa. tccacctcca ttggcatccg gcttaaggtg 
61 aatcgagccc acccagggcc cccacacatc atattggaac ctactaagct ggtgaggatt 
25 121 cgaggggagg ctgcccagat cgtgtgctcg gccactcact cagaagttga attcaacgtt 
181 atcctcaaac gtggagacac caagttggaa atccccataa acagtgactt ccaagacaac 
241 gcttataaaa aggtcctgac tcttaacctc aatgctgtgg acttccaaga tgctggcata 
301 tattcctgtg tggccaacaa cgcagctggc tcgaacacgg ccaccatgaa cttccaggtg 
361 gtggagagtg cctacttaaa cttgacctct gagcagagcc tcttgcagga ggtgtctgtg 
30 421 ggtgagaacc tcgacctcac agtcattgoa gatgcctacc ctggcctaca gcgttacaac 
481 tggacctacc tagggccgtt ctttgaagac ccacacaatc ttgagtttag aacccaatgg 
541 accacataca gctactcatt caaactccac ctgaaccgtg taaagccctt ggaggccggc 
601 cgctactcct taatggcaca aaacaaggca ggctggaata atctgacctt tgagctcacc 
661 ctgcgatacc ccccagaagt cagtgttaca tggatacctg tgaacggctc tgatgtcctg 
35 721 ctctgtgatg tctctgggta tcctcagccc aacgtgacat ggatggagtg caggggccac 
781 accgataggt gtgatgaggc ccaggcctcg caggtttggg atgacacaca acctgaagtc 
841 ctgagtcaga agcccttcca cagagtgatc cttcagagcc agctgcccat tgggacccta 
901 aagcacaaca tgacttatgt ttgcagagcc cacaacaatg tgggtaacag ctcccagttc 
961 ttcagggcca tctccctagg acaaagcaag cagctccctg atgagtacct gttcactcca 
40 1021 gtggtggtgg cctgtatatc tgtcatgtct ctgctggtac tactgctgct gctgctcttg 
1081 tacaagtaca agcagaagcc gaaatatcag gtgcgctgga agatcattga gagctacgag 
1141 ggcaacaact acaccttcat cgaccctacc cagttgccct acaatgagaa gtgggagttt 
1201 ccccggaaca acctgcaatt tggtgagact ctcggagctg gtgcctttgg gaaggtggtg 
1261 gaggccacag cctttggtct gggcaaagaa gatgcagtgc tgaaggtggc tgtgaagatg 
45 1321 ctcaagtcca cggctcatgc cgatgagaag gaggccctaa tgtcagaact gaagatcatg 
1381 agtcacctgg gacagcatga gaacattgtc aacctcttgg gagcctgtac tcatggaggg 
1441, cctgtcctgg tcatcaccga atactgctgc tacggagacc ttctcaactt ccttcgaagg. 
1501 aaggccgagg ctatgctggg acccagcctg agtcetggtc aggaccccga gggggactcc 
1561 agctacaaga acatccacct ggagaagaaa tatgtgcgca gggacagtgg cttctccagt 
50 1621 cagggcgtag atacctatgt ggagatgagg cctgtttcga cttcctcaaa tgactccttc 
1681 tttaagcaag atctggacaa agaggccagc cggccgctgg agctctggga cctgcttcac 
1741 ttctctagcc aagtggctca gggcatggct ttccttgctt ctaaaaattg catccatcgt 
1801 gatgtagctg ctcgaaatgt gctgttgacc agcggacatg tggccaagat tggggacttt 
1861 gggctggcta gggacatcat gaatgactct aactatgttg tcaagggcaa tgcccgcctg 
55 1921 cctgtaaagt ggatggcccc agagagcatc ttcgactgcg tctacacagt tcagagtgat 
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1981 gtgtggtcct acggcatcct cctctgggag attttctcac ttggtctgaa cccctaccca 

2041 ggcatcttag tgaacaacaa gttctacaaa ctggtgaagg atggatacca aatggcccag 

2101 cctgtatttg caccggagaa catatacagc atcatgcagt cctgctggga cctggagcct 

2161 accaaaagac ctaccttcca gcagatctgc ttcctcctcc aggaacaggc ccgactggag 

2221 aggagagagc aggactatgc taacctgcca agcagcagca gcagcagtag cagcagcagt 

2281 gacagtggtg gtggcagtgg tggtagcagc agtgagcctg aagaggagag ctccagtgag 

2341 cacctggcct gctgtgagcc aggggacatc gcccagcccc tgctgcagcc taacaactac 

2401 cagttctgct gaagcgggac agcagagtcc tgctgccctc cacgtcccag ctcgacctcc 

2461 tccatggatg ggcgacatgg ggagagcata tgaacttcgt cctcagctcg gcccagctct 

2521 gacgcttctg aacatgaggg gttcccagag cctgggccat cactgccagt ggggttctca 

2581 cagtgctagc ctctatttac tacaccaact ggtgaaccca tacttcactt tcttcatcct 

2641 gttcccactt gaaaaaactg tccccaactc tcgtttcaat ggaaagaetg atttgtgtct 

2701 caaaaagaca ggtctcaggt tgtaggtgag cagaagcttg cctccctgac agaggctcag 

2761 actgcaggct tcttggggca ggcggccctt cccaagctca cagactggtc gccactctta 

2821 ccttctcttt atctacagtc ccgtctcatc ctggatcttg tacactagga gcccagtggc 

2881 agctgagagc cagggatgct ttacttagtg ccctgcattc taggctggca ggcatgggac 

2941 cttggtgcaa ggctgacaac gccaagcaaa tactgcgtgc tcctctccaa actgactcgt 
3001 cctcattaac agtcaacatt aaactaacag cattaacac 

SEQ ID NO: 61 

Amino acid sequence of rat CSFR1 encoded by the DNA sequence shown in SEQ ID NO: 60. 

MELQPPLVLLLATVWHGQGAPVIEPSGPBLVVBPGETVTLRCVSNGSVEWIX5PISPYWTL 
DPESPGSTLTTRNATFKWTGTYRCTELEDPMA^ 

QE AVLPCL I TD PALKD S VS LMREGGRQ VLRKTVY FFS AWRGF I I RKAKVLDSNTYVCKTM 
VNGRESTSTGIWLKVNRVHPEPPQIKLEPSKLVRIRGBAAQIVCSATNAEVGFNVILKRG 
DTIU*EI PLNSDFQDNYYKKNHUUiSLNAVDFQDAGI YSCVASNDVGTRTATMMFQVVESAy 
LNLTS EQSLXiQEVS VGDS LI LTVHADA YPS IQHYNWT YLGP FFEDQRKLEF ITQRAI YRY 
TFKLFLNRVKASEAGQYFLMAQNKAGWNNLTFELT^ 

GYPQPSVTWMECRGHTDROTEAQALQVWNDTHPEVLSQKPFDKVIIQSQLPIGTLKHNMT 
YFCKTHNSVGNSSQYFRAVSLGQSKQLPDESLFTPVWACMSVMSLXiVLLLLLLLYKYKQ 
KPKYQVRWKIIERYEGNSYTFIDPTQLPYNEKWEFPRmniQFG 

GI^KEDAVLKVAVKMLKSTAHADEKEALMSELKIMSHLGQHENIVNLtiGACTHGGPVLVI 
TEYCCYGDLLNFI.RRKAEAMLGPSLSPGQDSEGDSSYKNIHLEKKYVRRDSGFSSQGVDT 
YVEMRPVSTSSSDSFFKQDLDKEPSRPLELWDLLHFSSQVAQGMAFLASKNCIHRDVAAR 
NVLLTSGHVAKIGDFGkARDIMNDSNYWKGNARLPV 

ILLWEIFSLGLNPYPGILVNNKFYKLVKDGYQMAQPVFAPKNIYS1MQSCWDLEPTRRPT 
FQQICFIiLQEQARLERRDQDYANLPSSGGSSGSDSGGGSSGGSSSEPEEESSSEHLACCE 
PGDIAQPLLQPNNYQFAC 

SEQIDNO: 62 

gi|231 10958|ref|NMJ)00396.2| Homo sapiens cathepsin K (pycnodysostosis) (CTSK), 
mRNA 

l aaattttcca gccgatcact ggagctgact tccgcaatcc cgatggaata aatctagcac 
61 ccctgatggt gtgcccacac tttgctgccg aaacgaagcc agacaacaga tttccatcag 
121 caggatgtgg gggctcaagg ttctgctgct acctgtggtg agctttgctc tgtaccctga 
181 ggagatactg gacacccact gggagctatg gaagaagacc cacaggaagc aatataacaa 
241 caaggtggat gaaatctctc ggcgtttaat ttgggaaaaa aacctgaagt atatttccat 
301 ccataacctt gaggcttctc ttggtgtcca tacatatgaa ctggctatga accacctggg 
361 ggacatgacc agtgaagagg tggttcagaa gatgactgga ctcaaagtac ccctgtctca 
421 ttcccgcagt aatgacaccc tttatatccc agaatgggaa ggtagagccc cagactctgt 
481 cgactatcga aagaaaggat atgttactcc tgtcaaaaat cagggtcagt gtggttcctg 
541 ttgggctttt agctctgtgg gtgccctgga gggccaactc aagaagaaaa ctggcaaact 
601 cttaaatctg agtccccaga acctagtgga ttgtgtgtct gagaatgatg gctgtggagg 
661 gggctacatg accaatgcct tccaatatgt gcagaagaac cggggtattg actctgaaga 
721 tgcctaccca tatgtgggac aggaagagag ttgtatgtac aacccaacag gcaaggcagc 
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781 taaatgcaga gggtacagag 

841 ggcccgagtg ggacctgtct 

901 cagcaaaggt gtgtattatg 

961 ggcagtggga tatggaatcc 
5 1021 agaaaactgg ggaaacaaag 
1081 cattgccaac ctggccagct 
1141 cttccatttc ttccacgatg 
1201 tatttttgaa gcagatgtgg 
1261 gcttcaaatg atccttccta 
10 1321 tcaggacttt ccctgacagc 
1381 ctgactgtgt tgtcccaggg 
1441 ttagattctc attcacggga 
1501 acttctcact tcctaagttc 
1561 tccaattcat aaatctattc 
• 15 1621 gatttgtctt cccttctttg 
1681 ataaatagca tctagtacac 

SEQ ID NO: 63 

Amino acid sequence of human CTSK encoded by the DNA sequence shown in SEQ ID NO: 

62. 

20 MWGLKVLLLPWSFALYPEEILDTHWELWKK^ 
NLEASLGVHTYELAMNHLGDMTSE 

YRKKG YVTPVKNQGQCGSC WAFSS VG ALEGQliKKKTGKLLNLS PQNLVDCV5ENDG CGGG 
YMTNAFQWQKNRGIDSEDAYPYVGQEESCMYNPTC 

R VG P V$ VA IP AS LTS FQPYSKGVYYDES CNSDNLNHAVLAVGYG I QKGNKHW 1 1 KNS WGE 
25 NWGNKGYILMARNKNNACGIANLASPPKM 

SEQ ID NO: 64 

Amino acid sequence of human CTSK, a soluble active secreted form derived from SEQ ID 

NO:63. 

THWELWKKTHRKQYNNKVDE ISRRL I WEKNLKY I S iHtfLEASLGVHT YBL AMNHLGDMTS 
30 EEWQKMTGLKVPLSHSRSNDTLY1 PEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 
SVGALEGQLKKKTGKLLNLSPQl^VDCVSEND^ 

VGQEESCMYNPTGKAAKCRGYREIPEGNEKAliKRAVARVGPVSVAIDASLTSFQFYSKGV 
YYDESCNSDNLNHAVIaAVG YG I0KGNKHW I X KNS WGBNWGNKGYILMARNKNNACG IANL 
ASFPKM 

35 SEQ ID NO: 65 

gi|31982432|reflNM_007802.2| Mus musculus cathepsin K (Ctsk), mRNA 

1 gagccacgct tcctatccga aaagagccta gcgaacagat tctcaacagc aggatgtggg 
61 tgttcaagtt tctgctgcta cccatggtga gctttgctct gtctccggag gaaatgctgg 
121 acacccagtg ggagctatgg aagaagactc accagaagea gtataacagc aaggtggatg 

40 181 aaacccctcg gcgtttaatt tgggagaaaa acctgaagca aatctctgcc cataacctgg 
241 aggcctctct tggtgtccat acatatgaac tggccatgaa ccacttggga gacatgaeca 
301 gtgaagaagt ggttcagaag atgacgggac tcagaatacc tccctctcga tcctacagta 
361 atgacactct ctatacccea gagtgggaag gcagggtccc agactccatc gactatcgaa 
421 agaaaggata cgttactcca gtcaagaace agggccagtg tggttcctgt tgggccttca 

45 481 gctctgccgg ggccctggag ggceaactca agaagaaaac tggtaaactc ttagctctga 
$41 gtccccagaa tcttgtggac tgtgtgactg agaattatgg ctgtggaggc ggctatatga 
601 eeactgcctt ccaatacgtg cagcagaacg gaggcattga ctctgaagat gcttacccat 
661 atgtgggcca ggatgaaagt tgtatgtata acgccacggc aaaggcagct aaatgcagag 
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agatccccga ggggaatgag 
ctgtggccat tgatgcaagc 
atgaaagctg caatagcgat 
agaagggaaa caagcactgg 
gatatatcct catggctcga 
tccccaagat gtgactccag 
gtgcagtgta acgatgcact 
tgatactgag attgtctgtt 
ctttgcttct ctccacccat 
tgtgtactct taggctaaga 
ctgatgctgt acaggtacag 
ctagttagct ttaagcaccc 
ccttctatat cctcaaggta 
ataagtcttt ggtacaagtt 
cacttttgaa ataaagtatt 
at 



aaagccctga agagggcagt 
ctgacctcct tccagtttta 
aatctgaacc atgcggtttt 
ataattaaaa acagctgggg 
aataagaaca acgcctgtgg 
ccagccaaat ccatcctgct 
ttggaaggga gttggtgtgc 
cagtttcccc atttgtttgt 
gacctttttc actgtggcca 
gatgtgacta cagcctgccc 
gctggagatt ttcacatagg 
tagaggacta gggtaatctg 
gaaatgtcta tgttttctac 
tacatgataa aaagaaatgt 
tatctcctgt ctacagttta 
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721 ggtacagaga gattcctgtg gggaacgaga aagccctgaa gagagcagtg gcgcgggtag 

781 gacccatctc tgtgtccatc gatgcaagct tggcatcttt ccagttttac agcagaggtg 

841 tgtactatga tgaaaattgt gaccgtgata atgtgaacca tgcagtgttg gtggtgggct 

901 atggcaccca gaagggaagc aagcactgga taattaaaaa cagctgggga gagagctggg 

5 961 gaaacaaagg atatgctctc ttggctcgga ataagaacaa cgcctgcggc attaccaaca 

1021 tggccagctt ccccaagatg tgattccagc cagccagccc atctcttctc agattccttc 

1081 cttcatggtg caagatattg gtggctttgg aagggagtgg gcatggcgct cctgagaggg 

1141 acgcagcgat gctaactaag attgtttcat ttctcctctc gttggtgctt ccagtgacaa 

1201 ctctacttcc cttctctctg cccagggccc ttttctttgt ggacacaaca gggcattttt 

10 1261 ctgagagttg tggactctgt gctggtagac attggagtcc tccagcaggc tggaggacta 

1321 aggtgacctt cccgagcccc tgtcttctgt atacaccagt gaacatttca gtcttccact 

1381 gagatgcaca aatctattcg tgattctttg acaaatttaq atgatattaa aaaaagtgtt 

1441 tttccttctt tgtatttgaa ataaagtatc tcatttacaa ttt 

SEQ ID NO: 66 

1 5 Amino acid sequence of mouse CTSK encoded by the DNA sequence shown in SEQ ID NO: 
65. 

MWVFKFLLLPMVSFALSPEBMLDTQW 

^EASLGVHT YELAMNHLGDMTSEEWQKMTGLRI PPSRS YSNDTL YTPEWEGRVPDS ID 
YRKKG YVTP VKNQGQCGS CW APS S AGALEGQLKKKTGKLLALS PQNLVPCVTENYGCGGG 
20 YMTTAFQ YVQQNGG IDSEDAYPYVGQDESCMYNATAKAAKCRG YRE I P VGNEKALKRAVA 
RVGPI5 VS IDASLA5 FQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWI I KNS WGE 
SWGNKGYALliARNKNNACGITNMASFPKM 

SEQ ID NO: 67 

gi|13928757|ref|NM_031560.1 1 Rattus norvegicus cathepsin K (Ctsk), mRNA 

25 l cttgtctgaa aagagcatag acaacagatt ctcaacagca ggatgtgggt gttcaagttt 

61 ttgctgctac ccgtggtgag ctttgctcta tccccggagg aaacgctgga cacgcagtgg 
121 gagctgtgga agaagaccca cgggaagcag tacaacagca aggtggatga aatctctcgg 
181 cgtttaattt gggaaaaaaa cctgaagaaa atttctgtcc ataatcttga ggcctctctt 
241 ggtgcccata cgtatgagct ggccatgaat cacctgggag acatgaccag cgaagaagtg 

30 301 gttcagaaga tgactggact cagagtgcca ccttcgcgtt ccttcagtaa tgacactctc 
361 tataccccag agtgggaagg cagagtccca gactccatcg actatcgaaa gaaaggctat 
421 gttactccag tcaaaaacca gggccagtgt ggttcctgtt gggctttcag ctctgcgggt 
481 gccctggagg gccaactcaa gaagaaaact ggcaaactct tagctctgag tccccagaat 
541 cttgtggact gtgtgtctga gaactatggc tgtggaggcg gctatatgac cactgccttc 

35 601 caatatgtgc agcagaatgg aggcattgac tctgaagacg cttacccgta tgtggggcag 
661 gatgaaagtt gtatgtataa cgccacggca aaggcagcta agtgcagagg gtacagagag 
721 atccctgtgg ggaacgagaa agccctgaag agagcagtgg ctcgggtagg acccgtctct 
781 gtgtccatcg atgcaagctt gacatctttc caattttaca gcagaggtgt gtactatgac 
841 gaaaactgcg accgtgataa tgtgaaccat gccgtgttgg tggtgggcta tggcacccag 

40 901 aagggaaata agtactggat aattaaaaac agctggggag aaagctgggg aaacaaaggc 
961 tatgttctet tggctcggaa taagaacaafc gcctgtggca ttaccaacct ggccagcttc 
1021 cccaagatgt gattcaagcc agccagccca tctgttctca cattccttcc tcaacagtgc 
1081 aagcgaacgg tggctttgga gtgacacctc tgcttccctt ctctccaccc aaggcccttt 
1141 tctttgtgga cacaactggg cattttctga gagttgtggc ctctgtgctg atagacgctg 

45 1201 gagtcctcca gcaggctgga gyactaaggt gaccttccca agcccctg 

SEQ ID NO: 68 

Amino acid sequence of rat CTSK encoded by the DNA sequence shown in SEQ ID NO: 67, 

MVWFKFLLLPWSFALSPEETLDTQWELW 
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. NLEASLGAHTYELAMNHLGDbfTSEEW 

YRKKGYVTPVKNQGQCGSCWAFSSAGAliBGQLKKKTGKLLALS 

YMTTAFQYVQQNGG IDS ED AY P YVGQDES CMYNATAKAAKCRG YRE I PVGNEKALKRA VA 
RVGPVSVS I D AS LTS FQFYS RGVYYDENCDRDNVNHAVLWG YGTQKGNKYW 1 1 KNS WGE 
5 SWGNKGYVLLARNKNNACGITNLASPPKM 

SEQIDNO: 69 

gi|4503 1 74|refINM_003467, 1 1 Homo sapiens chemokine (C-X-C motif) receptor 4 (CXCR4), 
mRNA 

1 gtttgttggc tgcggcagca ggtagcaaag tgacgccgag ggcctgagtg ctccagtagc 
10 61 caccgcatct ggagaaccag cggttaccat ggaggggatc agtatataca cttcagataa 
121 ctacaccgag gaaatgggct caggggacta tgactccatg aaggaaccct gtttccgtga 
181 agaaaatgct aatttcaata aaatcttcct gcccaccatc tactccatca tcttcttaac 
241 tggcattgtg ggcaatggat tggtcatcct ggtcatgggt taccagaaga aactgagaag 
301 catgacggac aagtacaggc tgcacctgtc agtggccgac ctcctctttg tcatcacgct 
15 361 tcccttctgg gcagttgatg ccgtggcaaa ctggtacttt gggaacttcc tatgcaaggc 
421 agtccatgtc atctacacag tcaacctcta cagcagtgtc ctcatcctgg ccttcatcag 
481 tctggaccgc tacctggcca tcgtccacgc caccaacagt cagaggccaa ggaagctgtt 
541 ggctgaaaag gtggtctatg ttggcgtctg gatccctgcc ctcctgctga ctattcccga 
601 cttcatcttt gccaacgtca gtgaggcaga tgacagatat atctgtgacc gcttctaccc 
20 661 caatgacttg tgggtggttg tgttccagtt tcagcacatc atggttggcc ttatcctgcc 
721 tggtattgtc atcctgtcct gctattgcat tatcatctcc aagctgtcac actccaaggg 
781 ccaccagaag cgcaaggccc tcaagaccac agtcatcctc atcctggctt tcttcgcctg 
841 ttggctgcct tactacattg ggatcagcat cgactccttc atcctcctgg aaatcatcaa 
901 gcaagggtgt gagtttgaga acactgtgca caagtggatt tccatcaccg aggccctagc 
25 961 tttcttccac tgttgtctga accccatcct ctatgctttc cttggagcca aatttaaaac 
1021 ctctgcccag cacgcactca cctctgtgag cagagggtcc agcctcaaga tcctctccaa 
1081 aggaaagcga ggtggacatt catctgtttc cactgagtct gagtcttcaa gttttcactc 
1141 cagctaacac agatgtaaaa gacttttttt tatacgataa ataacttttt tttaagttac 
1201 acatttttca gatataaaag actgaccaat attgtacagt ttttattgct tgttggattt 
30 1261 ttgtcttgtg tttctttagt ttttgtgaag tttaattgac ttatttatat aaattttttt 
1321 tgtttcatat tgatgtgtgt ctaggcagga cctgtggcca agttcttagt tgctgtatgt 
1381 ctcgtggtag gactgtagaa aagggaactg aacattccag agcgtgtagt gaatcacgta 
1441 aagctagaaa tgatccccag ctgtttatgc atagataatc tctccattcc cgtggaacgt 
1501 ttttcctgtt ctcaagacgt gattttgctg tagaagatgg cacttataac caaagcccaa 
35 1S61 agtggtatag aaatgctggt ttttcagttt tcaggagtgg gttgatttca gcacctacag 
1621 tgtacagtct tgtattaagt tgttaataaa agtacatgtt aaacttactt agtgttatg 



SEQ ID NO: 70 

Amino acid sequence of human CXCR4 encoded by the DNA sequence shown in SEQ ID 
40 NO: 69. 



MEGISIYTSDNYTEBMGSGDYDSMKEPCFREENANFNKIFLPTIYSIIFLTGIVGNGLVI 
LVMGYQKKXiRSMTDKYRLHLSVADLLFVITLPFWA 

YSSVLlLAFISLDRYIJVIVHATNSQRPRiaiAEKVVYVGVWIPALLIiTIPDFIFANVS^ 
DPRYICDRFYPNDLWVWFQFQHIMVGLILPGI VILSCYCI 1 1SKLSHSKGHQKRKALKT 
45 TVXLILAFFACWLPYYIGISIDSFILIjEXIKQGCEFK^ 

L Y AFLGAKFKTS AQHAliTS VS RG S S L K I LS KG KRGGHS SVSTESBSSS FHS S 

SEQ ID NO: 71 

gi|2632lOO|emb|Z80l 12J|MMLCR12 Mus musculus lcr-1 gene 
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1 atggaaccga tcagtatata cacttctgat aactactctg aagaagtggg gtctggagac 
61 tatgactcca acaaggaacc ctgcttccgg gatgaaaacg tecatttcaa taggatcttc 
121 ctgcccacca tctacttcat catcttcttg actggcatag tcggcaatgg attggtgatc 
181 ctggccatgg gttaccagaa gaagctaagg agcatgacgg acaagtaccg gctgcacctg 
5 241 tcagtggctg acctcctctt tgtcatcaca ctccccttct gggcagttga tgccatggct 
301 gactggtact ttgggaaatt tttgtgtaag gctgtccata tcatctacac tgtcaacctc 
361 tacagcagcg ttctcatcct ggccttcatc agcctggacc ggtaectcgc cattgtccac 
421 gccaccaaca gtcaaaggcc aaggaaactg ctggctgaaa aggcagtcta tgtgggcgtc 
481 tggatcccag ccctcctcct gactatacct gacttcatct ttgccgacgt cagccagggg 
10 541 gacatcagtc agggggatga caggtacatc tgtgaccgcc tttaccccga tagcctgtgg 
601 atggtggtgt ttcaattcca gcatataatg gtgggtctca tcctgcccgg catcgtcatc 
661 ctctcctgtt actgcatcat catctctaag ctgtcacact ccaagggcca ccagaagcgc 
721 aaggccctca agacgacagt catcctcatc ctagctttct ttgcctgctg gctgccatat 
781 tatgtgggga tcagcatcga ctccttcatc cttttgggag tcatcaagca aggatgtgac 
IS 841 ttcgagagca ttgtgcacaa gtggatctcc atcacagagg ccctcgcctt cttccactgt 
901 tgcctgaacc ccatcctcta tgccttcctc ggggccaagt tcaaaagctc tgcccagcat 
961 gcactcaact ccatgagcag aggctccagc ctcaagatcc tttccaaagg aaagcggggt 
1021 ggacactctt ccgtctccac ggagtcagaa tcctccagtt ttcactccag ctaaccctta 
1081 tgcaaagact tatataatat atatatatat attgataaag acttttttat gttacaccat 
20 1141 ttttccagat ataagagact gaccagtctt gtacagtttt 

SEQ ED NO: 72 

Amino acid sequence of mouse CXCR4 encoded by the DNA sequence shown in SEQ ID 
NO: 71. 

MEPISIYTSDNYSEEVGSGDYDSNKEPCFRDENVHFNRIFLPTIYFIIFIiTGIvaNGLVI 
25 LVMGYQK3aRSMTDKYRLHLSVADLLFVITLPFWAVDAMADWYFGKFLCKAVHIIYT\^ 
YSS VLILAF I S LDR YIAI VHATNSQRPRKLLAEKAVYVGVWI PALLLT I PDFI FADVSQG 
DISQGDDRYICDRLYPDSLWMWFQFQHIMVGLILPGIVILSCYCIIISKLSHSK3HQKR 
KALKTTV I L I LAFF ACWLP Y YVG I S I DS F X LLG V I KQGCDFE S 2 VHKW I S ITBALAFFHC 
CLtfPILYAFLGAKFKSSAQHALNSMSRGSSLKILSKGKRGGHSSVSTESESSSFHSS 

30 SEQ ID NO: 73 

gi|17902280|gb|AF452185.1|AF452185 Rattus norvegicus strain Holtzman chemokine 
receptor CXCR4 (Cxcr4) gene, complete cds 

1 atggaaatat acacttcgga taactactcc gaagaagtag ggtctggaga ctatgactcc 
61 aacaaggaac cctgcttccg ggatgaaaac gaaaacttca acaggatctt cctgcccacc 

35 121 atctatttta tcatcttctt gactggcata gtgggcaatg ggttggtaat cctggtcatg 
181 ggttaccaga agaagctgag gagcatgaca gacaagtacc ggctgcacct gtccgtggct 
241 gacctcctct ttgtcatcac actccccttc tgggcagtgg acgccatggc tgactggtac 
301 tttgggaaat ttttatgtaa ggctgtgcat atcatctaca ccgtcaacct ttacagcagt 
361 gttctcatcc tggccttcat cagcctggac cgctaccttg ccattgtcca cgccaccaac 

40 421 agccagaggc cgaggaagct gctggctgaa aaggccgtct atgtgggtgt ctggatcccc 
481 gccctcctcc tgactatccc tgacatcatc ttcgccgatg tcagccaggg ggacggcagg 
541 tacatctgtg accgccttta ccccgacagc ctgtggatgg tggtgttcca gttccagcac 
601 atcatggtgg gtctcatcct gccgggcatc gtcatcctgt cctgttactg catcatcatc 
661 tccaagctgt cacactccaa gggccaccag aagcgcaagg ccctcaagac tacggtcatc 

45 721 cttatcctgg ctttctttgc ctgctggcta ccgtattacg tggggatcag catcgattcc 
781 ttcatccttt tggaggtcat caagcaagga tgtgagttcg agagcgtcgt gcacaagtgg 
841 atctccatca cggaggccct cgccttcttc cactgttgcc tgaaccccat cctctacgcc 
901 ttcctcgggg ccaaattcaa gagctccgcg cagcatgcac tcaattccat gagcagaggc 
961 tccagcctca agatcctttc caaagggaaa cggggtggac actcttccgt ctccacagag 

50 1021 tcagaatcct caagttttca ctccagetaa 

SEQ ID NO: 74 
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Amino acid sequence of rat CXCR4 encoded by the DNA sequence shown in SEQ ID NO: 
73. 

MBIYTSDNYSBEVGSGDYDSNKEPCFRDEKBNFNRIFLPTIYFIIPLTGIVGNOLVILVM 
O YQKKLRSMTDKYRLHLS VADLLFVITLPFWAVDAMADW YFGKFLCKAVHI I YTVNLYSS 
VLILAF X SLDRYLAI VHATNSQR PRKLLAE KAVYVGVW I PAXLLT I PDI I FADVS QGDGR 
YICDRLYPDSLWMWF^FQHIMVGLILPGIVILSCYCIIISKLSHSKGHQKRKAIjKTTVI 
LIIJtfFACWLPYWGISIDSFlLLEVIKQGCEFESVVHKW^^ 
FLGAKFKSSAQHALNSMSRGSSLKILSK0KRGGKSSVSTESESSSFH5S 

SEQ ID NO: 75 

gi|7524353|ref|NM 01 3974, 1| Homo sapiens dimethylarginine dimethylaminohydrolase 2 
(DDAH2), mRNA~ 

1 ccgcttagac aatgccccgg agccgccaga ccgtcgcgcc cctgccccat cgtagtatat 
61 gagctcgcct acacaaggac ccccgctaaa agccagagct cccagtcccc gaggcttgaa 
121 gacggggact cccttctcca ccaactctgt cctcgggggg tggggcccca gccgagatca 
181 cagcgcgaca ggagtggggg tggccgctgg agacaggtga agaaacaaga aaactaagaa 
241 atccgagcgg ttggaggggg agtctgegtg gatgggatgg ggacgccggg ggaggggctg 
301 ggccgctgct cccatgccct gatccgggga gtcccagaga gcctggcgtc gggggaaggt 
361 gcgggggctg gccttcccgc tctggatctg gccaaagctc aaagggagca cggggtgctg 
421 ggaggtaaac tgaggcaacg actggggcta cagctgctag aactgccacc tgaggagtca 
4S1 ttgccgctgg gaccgctgct tggcgacacg gccgtgatcc aaggggacac ggccctaatc 
541 acgcggccct ggagccccgc tcgtaggcca gaggtcgatg gagtccgcaa agccctgcaa 
601 gacctggggc tccgaattgt ggaaatagga gacgagaacg cgacgctgga tggcactgac 
661 gttctcttca ccggccggga gtttttcgta ggcctctcca aatggaccaa tcaccgagga 
721 gctgagatcg tggcggacac gttccgggac ttcgccgtct ccactgtgcc agtctcgggt 
701 ccctcccacc tgcgcggtct ctgcggcatg gggggacctc gcactgttgt ggcaggcagc 
841 agcgacgctg cccaaaaggc tgtccgggca atggcagtgc tgacagatca cccatatgcc 
901 tccctgaccc tcccagatga cgcagctgct gactgcctct ttcttcgtcc tgggttgcct 
961 ggtgtgcccc ctttcctcct gcaccgtgga ggtggggatc tgcccaacag ccaggaggca 
1021 ctgcagaagc tctctgatgt caccctggta cctgtgtcct gctcagaact ggagaaggct 
1081 ggcgccgggc tcagctccct ctgcttggtg ctcagcacac gcccccacag ctgagggcct 
1141 ggccttgggg tactgctggc caggggtagg atagtatagg aagtagaagg ggaaggaggg 
1201 ttagatagag aatgctgaat aggcagtagt tgggagagag cctcaatatt gggggagggg 
1261 agagtgtagg gaaaaggatc cactgggtga atcctccctc tcagaaccaa taaaatagaa 
1321 ttgacctttt aaaaaaaaaa aaaaaaaaaa a 

SEQ ID NO: 76 

Amino acid sequence of human DDAH2 encoded by the DNA sequence shown in SEQ ID 
NO: 75. 

MGTPGBGLGRCSHALIRGVPESLASGEGAGAGLPALDLAKAQREHGVLGGKLRQRLGLQL 
LEIiP PEESLPLGPLLGDT A V I QGDT AL I TRPWS PARRPE VDG VRKALQDLGXiRI VE IGDE 
NATLDGTD VL FTGRE F FVGLS KWTNHRG AE I VADTFRD F AVSTVP VS GPSHLRGLCGMGG 
PRTVVAGSSDAAQKAVRAMAVLTDHPYASLTLPDDAAADCLFLRPGLPGVPPFLLHRGGG 
DLPNSQEALQKLSOVTLVPVSCSELEKAGAGLSSLCLVLSTRPHS 

SEQ ID NO: 77 

gi|7949034|refJNM 016765.1 1 Mus musculus dimethylarginine dimethylaminohydrolase 2 
(Ddah2),mRNA 

1 cgcccccgcc ccatcgtcgt acgtgagctc tcctaccctg ggaacccctc aaaagcccga 
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61 gctatccgtc tctccagctt gaggactcag 
121 gggtggggcc aaggctgaaa tcacaaggct 
181 agtcaagaga aaaaacaaaa cagcttggct 
241 acgccggggg aggggctggg tcgttgttcc 
5 301 ttggcatccg gggaaggtgc tggcgctggt 
361 agggagcatg gagtactagg aggtaaactg 
421 ctgcctcctg aggagtcact gccgctggga 
4 SI ggagacacgg ccctaatcac aaggccctgg 
541 gtgcgtaaag ccctccagga ctcgggactc 
10 601 acgctggacg gcaccgacgt cctcttcacc 
661 tggaccaatc atcgaggagc tgagatcgtg 
721 acggtaccgg tctcaggctc ctcgcaccta 
781 accgtggtgg ctggaagcag cgaggctgcc 
841 actgatcacc cctacgcctc tctgaccctc 
15 901 ctgcgtcctg ggttgcctgg tgccacacct 
961 gctctctgat gtcaccctgg tacctgtgtc 
1021 cctcagctcc ctctgcctgg tgctcagcac 
1081 ggatcccact ggttaggaat agagctgtat 
1141 tagtcgtggg gagatgcccc aggataggga 
20 1201 ttgggtgagt cctctctgtc aaaaccaata 
1261 aaaaaaaaaa 

SEQIDNO: 78 

Amino acid sequence of mouse DDAH2 encoded by the DNA sequence shown in SEQ ID 
NO: 77. 

25 MOTPOEOLORCSKALIRGVPESLASGEGAOAGLPALDLAKAQREHQVXiQGKLRQRLGLQL 
LELPPEESLPLGPLLGDTAVIQGDTALITRPWSPARRPEVDGVRKALQDLGLRIVBMGDB 
NATLDGTDVIiPTGREPPVGLSKWTNHRGAEIVADTFRDFAVSTVPVSGSSHLRGLCGMGG 
PRTWAGSSEAAQKAVRAMAALTDHPYASLTLPDDAASDCLFLRPGLPGATPFLLHRGQS 
ABAL 

30 SEQIDNO: 79 

gi|34852173|ref|XMJ215315.2| Rattus norvegicus similar to Ddah2 protein (LOC294239), 
mRNA 

l ggcttctgtg tggatgtgat ggggacgccg ggggaggggc tgggtcgctg ttcccatgcc 
61 ctgatccggg gtgtccccga gagcttggca tccggggaag gtgctggcgc tggtcttccg 
35 121 gctctggatc tggctaaagc tcaaagggag catggagtac taggaggtaa actgaggcaa 
161 cgactaggtc tgcagcttct tgaactgcct cctgaggaat cactgccgct gggaccactg 
241 cttggtgaca cggctgtgat ccaaggagac acggctctaa tcacaaggcc ctggagccca 
301 gcgcgtaggc ctgaggttga tggagtccgc aaagctctcc aggacttggg gctcagaatt 
361 gtggagatgg gggatgagaa cgctacgctg gacggcaccg acgtcctctt caccggccgg 
40 421 gagtttttcg taggcetctc caagtggacc aatcatcgag gagctgagat cgtggcagac 
481 acgttccggg acttcgctgt ctctacggta ccggtctctg gcgcctcgca tctgcgcggc 
541 ctctgtggca tgggaggacc tcgcacggtg gtggctggaa gcagtgaggc tgcccaaaaa 
601 gccgtcaggg caatggcagc actgactgat cacccctacg cctctctgac cctcccagat 
661 gacgcagcga gtgactgtct ctttctgcgt cctgggttgc ctggtaccac acctttcctt 
45 721 ctgcaccgcg gaggtgggga cctgcccaac agccaggagg ctctgcaaaa gctctctgac 
781 gtcaccctgg tacctgtgtc ctgctcggaa ctggagaagg ttggagctgg cctcagctcc 
* »• 841 ctctgcctgg tgctcagcac acgcccccac tgctgagggc tgggtttggg gctccaatta 
901 gctaggaata gagccgtcta gggagtagaa tcaggtaata gaggctgggt agtcgtggga 
961 gatgccccag gatagggaag gacttagtat gggacaaaga ctaggagcca gtgggtgagt 
50 1021 cttctctgtc aaaaccaata aaataaaatt ggccttttag at 
• 

SEQ ID NO: 80 
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actcccatct tcattaactc tctcctctgg 
gcaggagtgg aggtggccgc tgaccagtga 
ggaaggggct tcggtgtgga tgtgatgggg 
catgccctga tccggggtgt ccccgagagc 
cttccggctc tggacctggc taaagctcaa 
aggcaacgac tagggctgca gctgcttgaa 
ccactgcttg gtgacacggc tgtgatccaa 
agcccagcac gtaggcctga ggttgatgga 
cgaattgtgg agatgggaga tgagaatgcg 
ggccgggagt ttttcgtagg cctctccaag 
gcagacacgt tccgggactt cgctgtctca 
cgcggcctct gtggcatggg gggacctcgc 
caaaaagcag tcagggcaat ggcagcgctg 
ccagatgatg cagctagtga ctgtctcttt 
ttcctcctgc accgcggagg ctctgcagaa 
ctgctcagaa ctggagaagg ctggagctgg 
acgcccccac tgctgagggc ctggatttgg 
agtgggtaga atcagctaat agaggctggg 
aggacttagt gtggaaaaga atagaagcca 
aaataaaatt gaccttttag ataaaaaaaa 
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Amino acid sequence of rat DDAH2 encoded by the DNA sequence shown in SEQ ID NO: 

79. 

PK7TPGEGLGRCSHALIRGVPESLASGEGAGAGLPALDLAKAQREHGVLGG 
LELPPEESLPLGPLLGDTAVIQGin'ALITRPWSPARRPEVIXSVRKALQDLGLRIVEMGDE 
5 NATLDGTDVLFTOREFPVGLSKVrniHRGAEIVADTPRDPAVSTVPVSG 
PRTWAGSSEAAQKAVRAMAALTDHPYASLTLPDDAASDCLFI/RPG^^ 
DLPNSQEALQKLSDVTLVPVSCSELBKVGAGLSSLCLVLSTRPHC 

SEQ ID NO: 81 

gi|8922242|ref|NMJ)t8004.1 1 Homo sapiens hypothetical protein FU10134 (FU10134), 
10 mRNA 

X gaagcacatc tggacagctg tgcggcctcc ttgcgggccg acgtcagccg agcacgtccc 
61 ccacgtcctc tccttctcgc cacttattat ttattcgtct tcccaaagaa gcgactaggg 
121 acccaagttt aaaaattcct ccccccactc aatgcgagac gtggccagat cccatccaac 
181 acacggttta attttcatgg ggctctggga tcaaaagaac agaaacagca acaacaaaag 
15 241 cccagccgct gtctgatttt aagctggcaa agtgggaaaa ataaagtgtt gagtaaacag 
301 accaagttgg atcatgggga atttcagagg tcatgccctc cctggaacct ccttttttat 
361 tattggtctt tggtggtgta caaagagtat tctgaagtat atctgcaaaa agcaaaagcg 
421 aacctgctat cttggttcca aaacattatt dtatcgattg gaaattttgg agggaattac 
481 aatagttggc atggctttaa ctggcatggc tggggagcag tttattcctg gagggcccca 
20 541 tctgatgtta tatgactata aacaaggtca ctggaatcaa ctcctgggct ggcatcattt 
601 caccatgtat ttcttctttg ggctgttggg tgtggcagat atcttatgtt tcaccatcag 
661 ttcacttcct gtgtccttaa ccaagttaat gttgtcaaat gccttatttg tggaggcctt 
721 tatcttctac aaccacactc atggccggga aatgctggac atctttgtgc accagctgct 
781 ggttttggtc gtctttctga caggcctcgt tgccttccta gagttccttg ttcggaacaa 
25 841 tgtacttctg gagctattgc ggtcaagtct cattctgctt caggggagct ggttctttca 
901 gattggattt gtcctgtatc eccccagtgg aggtcctgca tgggatctga tggatcatga 
961 aaatattttg tttctcacca tatgcttttg ttggcattat gcagtaacca ttgtcatcge 
1021 tggaatgaat tatgctttca ttacctggtt ggttaaatct agacttaaga ggctctgctc 
1081 ctcagaagtt ggacttctga aaaatgctga acgagaacaa gaatcagaag aagaaatgtg 
30 1141 actttgatga gcttccagtt tttctagata aaccttttct tttttacatt gttcttggtt 
1201 ttgtttctcg atcttttgtt tggagaacag ctggctaagg atgactctaa gtgtactgtt 
1261 tgcatttcca atttggttaa agtatttgaa tttaaatatt ttctttttag ctttgaaaat 
1321 attttgggtg atactttcat tttgcacatc atgcacatca tggtattcag gggctagagt 
1381 gatttttttc cagattatct aaagttggat gcccacacta tgaaagaaat atttgtttta 
35 1441 tttgeettat agatatgctc aaggttactg ggcttgctac tatttgtaac tccttgacca 
1S01 tggaattata cttgtttatc ttgttgctgc aatgagaaat aaatgaatgt atgtactttg 
1561 gtgc 

SEQ ID NO: 82 

Amino acid sequence of human DERP7 encoded by the DNA sequence shown in SEQ ED 
40 NO: 81. 

MGNFRGHALPGTPFFIIGLWWCTKSILKYICKKQKRTCYLGSKTLFYRLEILEGITIVGM 
ALTGMAGEQF I PGG PHLMLYD YKQGHWNQLLG WHHPTMY FFFGLLGVAD ILCFTISSLPV 
SLTKLMLSNALFVEAFI FYNHTHGRBMLD I FVHQLLVLWFLTGLVAFLEFLVRNNVLLE 
LLRSSLILLQGSWPFQIGF\0*YPPSGGPAWDLMDHENILFLTICFCWHYAVT1VIVGMNY 
45 AFITWLVK£RLKRLCSSEVGLLKNA£REQESEEEM 

SEQ ID NO: 83 

gi|31542277|ref]NM 019631.2| Mus musculus RDKEN cDNA C630002M10 gene 
(C630002M10Rik), mRNA 
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1 ggccccgccg gggaaaccag atgaagcaca tctggacagc tgtgctgaga aagtttgtgg 
61 gcctcttcca ggcctgccac ccgcaaggca cgtcccccac gtagcctcct gctagccact 
121 tactacttaa tttttttttc cccaagaagc aataagcaac ccacgcttga atctttttct 
181 ctccctcccc cccacccccc atgtgaggcg aggccacatc acattcaaca cagtttagtt 
5 241 ttcatggggc tttgagatca aaagaacaga aacagcaacc aaagctcagc tgccttctga 
301 tcctaactga caaagtgggg agagtaaggt gtgcgcaaac aggacaagtt gggtcatggg 
361 gagtttcaaa ggacatgctc tccctgggag tttcttcttc gccatgggct tttggtggac 
421 tatgaagaac atcctgaaat ctgtetacaa aaggcaaact cgaacctgct accttaactc 
481 taaaacatta ttacgtcgga cagagatttg ggaaggagtt gttgtgcttt taatgtctct 
10 541 cactggtata gctggtgaac agtttatctc aggaggacct gccttgatct tgcataaaga 
601 tggccagtgg aaccagatcc tgggctggca tcacacaacc atgtacttat tctttgggct 
661 acagggtata acccaaatca tatgtttcac tactaatgta cttccacttt cctcaagcaa 
721 gttaatgtta tcaattgcca tctttgtgga gacatttatg ttctacaacc acacacacgg 
781 tcgggaaatg attgacattt ttgtacacca acttctggtc ttcgttggca cattttcggg 
IS 841 tctggttgcc ttctbggagt tcctcgtaaa gaacaacgca cttctggagc tcctgcggtg 
901 cagtctcctc atgtttcaag gaacctggtt ctggcagatg gcgtttgtgc tgtacccccc 
961 tagtggaagt gctacatgga acctgtcaga tattcaaaat aaaatgtttc tetcaatgtg 
1021 cttttgctgg cattatgcat caatccttat cctcattgga gtaaaatatg ctttggccaa 
1081 ctggttagtc aagtctaggc tgaggaaggg ctgcacctca gaagttggac tcctgaagca 
20 1141 tgctgaccgt gagcaagaat cagaagaaga agtatgatct tgaagtcttt cttgataagc 
1201 cttctccctt tgcgttgcct ttgttcatgg ctttgtttct tgacctctgg tctcaagaac 
1261 acttgtctga ggctgactcc atgctgtttg tacttccagt tttgttaaag tgttggactt 
1321 taagtatctt actttcagct ctgaaagaac catgagtgat aaattcactt tttacactgt 
1381 gcatgccatg taattcaaga ccaatcataa ttgttttcca aagtttagtt tcgtgtccat 
25 1441 ttattaaaaa tatttttttt attttccggg tagatacctt caagatttgt ggacttgcea 
1501 tcactgtaat acatgacgtg ttgacttgta tttgtctatc ttgttgccac aatggaaaat 
1561 aaatgaatgc atgcaccttg gtgcagaaac caaaaatcct catttttttt ttcttagtaa 
1621 agtataccgc accctcacat gatacagaaa aaaatctgca tgtacaaaat tccatttctt 
1681 tgagactttt ctctatggag agcttgatgc aaaaggtaga gcagaagtct tcttctgcat 
30 1741 tttcaaaaag taatgatgtt gagacatata gtttcaaaga tgacagagac taggagagag 
1801 gaattgtggt attcacataa cctagcgttg gtcactgtgc catttgctct gttaatgagg 
1861 taaaggatta aataacaagc ctttctacaa tcctaggtaa aaagtctttt tgccctgtgg 
1921 gccaagaaac tcccagatgt gtgtaaaaaa aaaattatgc atattactct ggccctactg 
1981 ttaaccactc tccatgtttc tcataattag ctcatctttc ttcttgtact tgatcttagt 
35 2041 taaaaggcca aaaggtggtt cttcactcta aattaaaggg gttaaaatga cttaataggc 
2101 atatggaccc tttcttacta tcacatccta tgaatctcaa atggaaacaa gaagagaact 
2161 aaattaatac aatgttacac atcatggtgg acctgaagaa tgagatgaat ataatcttct 
2221 tgcccaaaaa ctgggagcag tcatcttgct acataagatt ttaaaaagac agatgtgagt 
2281 acccaaaaat atctcttgag ctgtaagtga gtgtactcca ggatacaagg taccgatatg 
40 2341 tcatggtcct ctgacgatca tgtgttctgc tcatgtctgt cttagattta aggactctat 
2401 tttagaccaa caacattctg tgactgccac atttgagctt caaaggaacc aggaatcagc 
2461 ctcagctagt tgagacaagt cactgatatt gatgacagat aaggttacac ccggaagttt 
2521 gaaagccaat gaatccagat tttctgctgt ttttatgaga atacagagat cactacttct 
2581 ccaggttcaa acccagagaa tacaagtaaa cttcaaccca gggagttctc agaacatctg 
45 2641 agtctgagac cagttcagga atgcttccct acagtgtctg aaataaaaac ctttcttgc 



SEQ ID NO: 84 

Amino acid sequence of mouse DERP7 encoded by the DNA sequence shown in SEQ ID 
NO: 83. 

50 MGS PKGHALPGS FFFAMGFWWTMKN ILKS WKRQTRTC 

SLTGIAGEQFISGGPALILHKIX3QWNQILC»rainrrMYLFFGLQG 

S KLMLS IAI FVETFMFYNHTHGREMIDI FVHQLL VFVGTFSGLVAFLEFLVKNNALLELL 
RCSXiLMFQGTWFWQMAFVLYPPSGSATWNIiSDIQNKMFLSMCFCWHYASILILXG\ncyAL 
ANWLVKSRLRKGCTSEVGLLKHADREQESEBEV 

55 SEQ ED NO: 85 
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gi|34868010|rcf|XM_340979.1 1 Rattus norvegicus similar to Dermal papilla derived protein 7 
homolog (19.5) (LOC360708), mRNA 

1 gccccgccgg gaaaccagat gaagcacatc tggacagctg tgccgagaaa gtttgtgggc 
61 cctttccagg cctgccacca gcaaggcacg tcccccacgt agcctcctgc tagccaetta 
5 121 ctacttaaat attttttttt cccaagaagc aatcagcaac ccaagcttga atccttttct 
181 tctcccctat ttgaggcgag gccacatcac atcaacacag tttaattttc atggggcttt 
241 gcgatcaaaa gaacagaaac ggcaacaaaa gtccagctgc cttctgatcc taactgacaa 
301 agtggggacc cagtaaggtg tgagtaaaca ggccaagctg ggtcatgggg agtttcatag 
361 gtcatgctct ccctgggact ttcttcatca tgatgggctt ttggtggact acaaagaaca 
10 421 ttttgaaatc tgtttacaaa aaacacactc gaacctgcta tttgaattct aaaacattat 
481 tacgtcgaat agagatttgg gaaggagttg ttgtggttat aatggctctt actggtatag 
541 ctggggaaca gtttatctcg ggaggacetg ccttgatctt gtataaagac ggccaatgga 
601 accagatcct gggctggcat cacaccacca tgtacttctt ctttggtcta cagggtgtaa 
661 cccagatcgt atgttttact actaatgcac ttccgctttc cttaagcaag ttgatgttag 
IS 721 cgaatgccat ctttgtggag acatttatct tctacaacca cacacatggt cgggaaatgg 
781 ttgatatttt tgtacaccaa cttctgtcct acaccaccac ggcggcgggt ctagttgcct 
841 tcatggagtt cctcacaaag aacaatgtac ttctggagct cgtgaggtca agtctcatcc 
901 tattacaagg aacctggttc tggcaggttg cttttgttct gtaccctcct aaaggaagag 
961 ctacatggaa cctgtccgat attggaaata aaatgtttct ctcaatgtgc ttttgttggc 
20 1021 attatgcatc aattgtcatg ctcatcggag taatatttgc gttggccaac tggttagtta 
1081 aatctagact taggaaggtc tgcacctcag aagttggact ccttaaacat gttgaccgtg 
1141 aacaagaatc agaagaagaa gtatga 

SEQIDNO;86 

Amino acid sequence of rat DERP7 encoded by the DNA sequence shown in SEQ ID NO: 85. 

25 MaSFIGHALPGTFFIMMGFWWTTKN^^ 

ALTGIAGBQFISGGPALlLYKDGQWNQILGWHHTTMYFFFGLQaVTQIVCFTTNALPLSL 
SKLMLANAI FVETFI FYNHTHGREMVDI F VHQLLS YTTTAAGL VAFME FLTKNNVIiIjEL V 
RSSLILLQGTWFWQVAFVLYPPKGRATWNLSDIGNKMFLSMCFCWHYASIVMLIGVIFAL 
ANWLVKSRLRKVCTSEVGLLKHVDREQBSEEBV 

30 SEQ ID NO: 87 

gi|13376090|reflNMJ)24756.1| Homo sapiens elastin microfibril interfacer 3 (EMILIN3), 
mRNA 

1 aagacaacgt cactagcagt ttctggagct acttgccaag gctgagtgtg agctgagcct 
61 gccccaccac caagatgatc ctgagcttgc tgttcagcct tgggggcccc ctgggctggg 

35 121 ggctgctggg ggcatgggcc caggcttcca gtactagcct ctctgatctg cagagctcca 
181 ggacacctgg ggtctggaag gcagaggctg aggacaccag caaggacccc gttggacgta 
241 actggtgccc ctacccaatg tccaagctgg tcaccttact agctctttgc aaaacagaga 
301 aattcctcat ccactcgcag cagccgtgtc cgcagggagc tccagactgc cagaaagtca 
361 aagtcatgta ccgcatggcc cacaagccag tgtaccaggt caagcagaag gtgctgacct 

40 421 ctttggcctg gaggtgctgc cctggctaca cgggccccaa ctgcgagcac cacgattcca 
481 tggcaatccc tgagcctgca gatcctggtg acagccacca ggaacctcag gatggaccag 
541 tcagcttcaa acctggccac cttgctgcag tgatcaatga ggttgaggtg caacaggaac 
601 agcaggaaca tctgctggga gatctccaga atgatgtgca ccgggtggca gacagcctgc 
661 caggcctgtg gaaagccctg cctggtaacc tcacagctgc agtgatggaa gcaaatcaaa 

45 721 cagggcacga gttccctgat agatccttgg agcaggtgct gctaccccac gtggacacct 
781 tcctacaagt gcatttcagc cccatctgga ggagctttaa ccaaagcctg cacagcctta 
841 cccaggccat aagaaacctg tctcttgacg tggaggccaa ccgccaggcc atctccagag 
901 tccaggacag tgccgtggcc agggctgact tccaggagct tggtgccaaa tttgaggcca 
961 aggtccagga gaacactcag agagtgggtc agctgcgaca ggacgtggag gaccgcctgc 

50 1021 acgcccagca ctttaccctg caccgctcga tetcagagct ccaagccgat gtggacacca 
1081 aattgaagag gctgcacaag gctcaggagg ccccagggac caatggcagt ctggtgttgg 



187 



BNSDOCIO: <WO_J2C060O7400«LL> 



WO 2(106/007400 



PCT/US2005/021297 



1141 caacgcctgg ggctggggca aggcctgagc cggacagcct gcaggccagg ctgggccagc 
1201 tgcagaggaa cctctcagag ctgcacatga ccacggcecg cagggaggag gagttgcagt 
1261 acaccctgga ggacatgagg gccaccctga cccggcacgt ggatgagatc aaggaactgt 
1321 actccgaatc ggacgagact ttcgatcaga ttagcaagge ggagcggcag gtggaggagc 
1381 tgcaggtgaa ccacacggcg ctccgtgagc tgcgcgtgat cctgatggag aagtctctga 
1441 tcatggagga gaacaaggag gaggtggagc ggcagctcct ggagctcaac ctcacgctgc 
1501 agcacctgca gggtggccat gccgacctca tcaagtacgt gaaggactgc aattgccaga 
1561 agctctattt agacctggac gtcatccggg agggccagag ggacgccacg cgtgccctgg 
1621 aggagaccca ggtgagcctg gacgagcggc ggcagctgga cggcfecctcc ctgcaggccc 
1681 tgcagaacgc cgtggacgcc gtgtcgctgg ccgtggacgc gcacaaagcg gagggcgagc 
1741 gggcgcgggc ggccacgtcg cggctccgga gccaagtgca ggcgctggat gacgaggtgg 
1B01 gcgcgctgaa ggcggccgcg gccgaggccc gccacgaggt gcgceagctg cacagcgcct 
1861 tcgccgccct gctggaggac gcgctgcggc acgaggcggt gctggccgcg ctcttcgggg 
1921 aggaggtgct ggaggagatg tctgagcaga cgccgggacc gctgcccctg agctacgagc 
1981 agatccgcgt ggccctgcag gacgccgcta gcgggctgca ggagcaggcg ctcggctggg 
2041 acgagctggc cgcccgagtg acggccctgg agcaggcctc ggagcccccg cggccggcag 
2101 agcacctgga gcccagccac gacgcgggcc gcgaggaggc cgccaccacc gccctggccg 
2161 ggctggcgcg ggagctccag agcctgagca acgacgtcaa gaatgtcggg cggtgctgcg 
2221 aggccgaggc cggggccggg gccgcctccc tcaacgcctc ccttgacggc ctccacaacg 
2281 cactcttcgc cactcagcgc agcttggagc agcaccagcg gctcttceac agcctctttg 
2341 ggaacttcca agggctcatg gaagccaacg tcagcctgga cctggggaag ctgcagacca 
2401 tgctgagcag gaaagggaag aagcagcaga aagacctgga agctccccgg aagagggaca 
2461 agaaggaagc ggagcctttg gtggacatac gggtcacagg gcctgtgcca ggtgccttgg 
2521 gcgcggcgct ctgggaggca ggatcccctg tggccttcta tgccagcttt tcagaaggga 
2581 cggctgccct gcagacagtg aagttcaaca ccacatacat caacattggc agcagctact 
2641 tccctgaaca tggctacttc cgagcccctg agcgtggtgt ctacctgttt gcagtgagcg 
2701 ttgaatttgg cccagggcca ggcaccgggc agctggtgtt tggaggtcac catcggactc 
2761 cagtctgtac cactgggcag gggagtggaa gcacagcaac ggtctttgcc atggctgagc 
2821 tgcagaaggg tgagcgagta tggtttgagt taacccaggg atcaataaca aagagaagcc 
2881 tgtcgggcac tgcatttggg ggcttcctga tgtttaagac ctgaacccca gccccaatct 
2941 gatcagacat catggactcg cccagctctc ctcggcctgg ggctctggcc aaggatgggc 
3001 tggaggtcat tcagttggtc tgtctcttcc ctggaaacct tctgcaaaga tggtgtggtg 
3061 tacgtggctt ccctgtaacc acatggggct tggccatttc tccatgatga gaaggactgg 
3121 aatgcttctc cgggcaggac atggtcctag gaagcctgaa ccttggcttg gcatgccttc 
3181 tcagacagca cggcctgggc tccaactctt caccacaccc tgtattctac aacttctttg 
3241 gtgttttgct cctcctgtgg ttggaaactt ctgtacaaca ctttaaactt ttctcttgct 
3301 tcctcttctc ttctccctta tcgtatgata gaaagacatt cttccccagg aggaatgttt 
3361 aaaatggagg caacattttg gccaacattg gaaagcacta gagggcaatg ggattaaacc 
3421 aacctgcttg gtctctatta gtcagtaatg aagacgacag cctggccaac caagggaaag 
3481 gaaattagta tctttagttt cagtcattcc ttgtaggata tggtttagct gtgcccccac 
3541 ctaaaatatc atcttgaatt gtaatcccta taatccccac atcaagggag agatcaggtg 
3601 gaggtaattg gatcttgggg gcggttcccc catgctgttc ttgtgatagt tctcacgaga 
3661 tctgatgatt ttataagttt gatagttcct cctgtgttca ttctccttcc tgccaccttg 
3721 tgaagatgcc ttggttcctc ttcactgtct gccatgattg taagtttcct gaggcctccc 
3781 cagccatgtg gaacagtgag tcaattaaac ctctttcctt tataaatt 

SEQ ID NO: 88 

Amino acid sequence of human ENDOGLYX1 encoded by the DNA sequence shown in 
SEQ ID NO: 87. 

MILS LLFS IiGGPLGWGLLGAWAQAS STSLS DLQS SRTPG VWKAEAEDTSKDP VGRNWC P Y 
PMS KLVTLLALCICreKTL IH5QQPC PQGAPDC^ 

CCPG YTGPNCEHHDSMAI PEP ADPGDSHQ BPQDGP VS FKPGHLAAV I NBVEVQQBQQEHL 
LGDLQNDVHRVADSLPGLWKALPGN^ 

FSPIWRSFNQSLHSLTQAIRIO-SLDVEANRQAISRVQDSAVARADFQELGAKFEAKVQEN 
TQRVGQLRQDVEDRLHAQHFTIiHRS I SELQ ADVDTKL KMjHKAQEAPGTNGSLVLATPGA 
G ARPE PDSLQARLGQLQRNLS ELHhTTTAR^ I KELYSE SD 

ETFDQI SKVERQVBBLQVNHTALRELRVI LMEKSL IMEENKEE VERQLLELNLTLQHLQG 
OHADL I KYVKDCN CQKLYLDLD VI REGQRDATRALE ETQ VSLDERRQLDG S S LQ ALQNAV 
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DAVSUVVDAHKAEGERARAATSRIJISQVQALDDBVGALKAAAAEARHEVRQLHSAFAALL 
EPAlJ^HEAVLAALFOBEVLEEMSEQTPGPLPLSyEQIRVALQDA^ 
RVTALEQASEPPRPAEHLEPSHDAGREEAATTAIAGLARB 
AGAASLNASLDGLHNALFATQRSLEQHQRLraSLFGOT 
S GKKQQKDLE APRKRDKKEAE PLVD I RVTG P VPG ALGAALWEAGS PVAF YAS FS EGTAALQ 
TVKFNTTYINXGSSYFPBHGYFRAPERGVYLFAVSVEFGPGPGTGQLVFGGHHRTPVCTT 
GQGSGSTATVFAMAELQKGBRVWFELTQGSITKRSLSGTAFGGFLMFKT 

c 

SEQ ID NO: 89 

gi|37620146|ref|NM_153127,2| Mus musculus elasrin microfibril interfacer 3 (Emilin3), 
10 mRNA 



1 ttttctgaca tttcgcttga agaccacatc accagcaaat ttgagagtca cttgtaaggc 
61 tgagcatcag cacaggagcc cctcaccatg atcccgacac tgctgctggg ctttggggtg 
121 tacctgagct ggggactgct agggtcctgg gcacaggacc ccggtaccaa gttctcccat 
181 ctcaataggc ccggcatgcc tgaaggctgg agactagggg ctgaggatac cagcagagac 
IS 241 cccatcagac ggaactggtg tccttaccag aagtccaggc tagtcacctt tgtagctgct 
301 tgcaaaacag agaaattcct ggtccattca cagcagccat gtccacaggg agcccctgac 
361 tgccagggag tcagagtcat gtatcgagtg gcccagaagc cagtgtacca ggtccagcag 
421 aaggtgctga tctctgtgga ctggcggtgc tgcccagggt tccagggacc agactgccag 
4 81 gaccacaatc ccacagcaaa ccctgagccc acagagccaa gtggtaaact ccaggagact 
20 541 tgggactcga tggatggctt tgaacttggt caccctgtcc cagagtttaa tgagattaag 
601 gtgccacaag aacaacagga aaacctgctt caaaatctcc agaatgatgc ccagtcagta 
661 gaagatggct ttccaggctc ttgggaagcc ccacccagca acctcacaga tgagacgaca 
721 gaagccaatc taacagaatt cgagtttcct ggcaggacat cagagcacct gctgcagccc 
781 catattgatg cattcctgaa agcacacttc agtcccatct ggaagaactt caacgacagc 
25 841 ttgcacagcc tctcccaggc catcagaaac ttgtctcttg atgtggaggc caatcaccag 
901 gccatcaaga tgatccagga gggcacagtg gctagggctg acttccaaga gcttggtgcc 
961 aagtttgagg ccaaggtcca gcagaatagc cagagactgg gccaactgtg gcaggatgtg 
1021 gaggaccagc tgcatgccca gcgccgatcg gtgcatcatg ccctctctga tgtccaggct 
1081 gaggtgagca ccaagttaaa gcagcttgtc aaggctcagg aacttccagg ggccaatggc 
30 1141 ggcctggtga tggcatctgc agcagcggca gcaaggccag agccagagag cctgcaggcc 
1201 aggctagggc agctgcagag aaacctctct gctctgcaca tggtcactag ccagagggag 
1261 gaggagttgc agagcaccct caagaacatg gacagcgtcc tgaagcagca cgccgaagag 
1321 atcaaagagc tctattctga atcggatgag accttcgacc agatcagcaa ggtagagagg 
1381 caggtggagg agctgctggt gaaccacacc gggcttcgag agctgcgggt gatcctaatg 
35 1441 gaaaagtccc tgatcatgga ggagaacaaa gaggagatag agcggcaact gttggaactc 
1501 aaccttaccc tgcagcatct gcatgcgggt catgcagacc tcattaagta tgtcaaggac 
1561 tgcaactgcc aaagggtcaa ctctgacgtg gatgtcgctc cggagggcca cagggatgtc 
1621 atgcacaccc tagaagagac ccaagtgagc ctggacgaac agcaccagct agacggttct 
1681 tctttgcagg ccctgcaaag cactgtagat gccatgtctt cagcaatgga tgcctataga 
40 1741 ggagagggtg aacgggcccg ggctgaaagg gcacggatac ggagecaact gcgggctctg 
1801 gatcatgctg tggaagcgct gaagacagcg gcgaatggaa cccgcaaaga gatacgccta 
1861 ctgcacggct ccttcacagc cctgctggag gatgccctgc gacatcaggc cgtgctagct 
1921 gcactcttcg gggaggagat gatagacgag atgtcagagg aggcccctcg ccctctgcca 
1981 ctggattatg agcagatccg cctagccctg caggacgcgg ccagtgggct aeaggaacag 
45 2041 gcgattggtt gggaggactt ggccacccgg gtggaggcat tggagaaggc cgcaggtggc 
2101 tttgtggagc agcacccaca gttggcagag ggacttgagc ccagccacga ctctgggaga 
2161 gaggaggaag ccatgacttt ggcggagctg gagcaggaga ttcggcgcct gagttctgat 
2221 gtcaagcaga ttgggcagtg ctgtgaggcc tcctgggctg cctccctcaa tagctccctt 
2281 gaagacctac acagcatgct cttggacacc •cagcacggcc tgagacagca ccggcagctc 
50 2341 ttceacaace tcttccagaa cttccaaggt ctggtggcaa gcaacatcag cctagacttg 
2401 gggaagctgc aggccatgtt gagtaagaaa gataagaagc aaccgagagg cccaggagaa 
2461 tcccggaaga gggataagaa gcaagtggtg atgtctacag atgcacacgc caaaggtctg 
2521 gagctctggg agacaggctc ccctgtggcc ttctatgccg gttcttcaga aggggccact 
2581 gctctgcaga tggtgaagtt caacaccaca tccatcaatg tgggcagcag ctactttcct 
55 2641 gaacatggct acttccgagc tcccaaacgt ggcgtctact tgtttgctgt gagcattaca 
2701 tttggcccag gcccaggaat ggggcagctg gtatttgaag gtcatcaccg ggttccagtc 
2761 tacagtacgg aacagagggg cgggagcaca gcccaccact tttgctatgg tagagctaea 
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2821 aaagggtgag agagcgtggt ttgagttaat ccaagggtca gcaaccaaag ggagccaacc 

2881 aggcactgca tttgggggct tcctgatgtt caagacctga accctgggct cggctcaget 

2941 tgtatcagac atggtagatc cgcctgctgc tcttagactg aggctctggc cagcaaaggc 

3001 tggagaacat ctagtggcct agctcttccc tggacacctt ctgcaaagac cctgcggcca 

3061 gcagcacgta ccttcttctg gacacacagg ttggggaggc cagaatactg ctctctggac 

3121 tgggccgggg cctggagtgg gagctgggat ttccatgcct tccccaatgg cacggctcgg 

3181 cttgtctctg agacacttcc tcagttctac agcttttctt tttctctccc ttgtggttgg 

3241 aaacttgtac acttacgcct ttttcggttt ccacttttct gtgttagaaa gtcactgttt 

3301 ctcagaagga atgtctacag tgatgggttt gccatgcaga aaggtcccag attctctttc 
3361 caatgtc 

SEQ ID NO: 90 

Amino acid sequence of mouse ENDOGLYX1 encoded by the DNA sequence shown in SEQ 
ID NO: 89. 

MIPTLLLGFQVYLSWGLLGSWAQDPGTKFSHLNRPGMPEGWRLGAEDTSRDPIRRNWCPY 
OKSRLVTFVAACKTEKFLVHSQQPCPQGAPDCQGVRVMYRVAQKPVYQVQQKVLISVDWR 
CCPGPQGPDCQDHNPTANPBPTEPSGKLQETWDSMDGFELGHPVPBFNEIKVPQEQQBNL 
LONLQNDAQSVEDGFPGSWEAPPSNLTDEMTEANLTBFBFPGRTSEHLLQPHIDAFLKAH 
FSPIWKNFNDSLHSLSOAIWJLSLDVEANHQAIKMIQEGTVARADFQELGAKFEAKVQQN 
SQRLGQLWQDVEDQLHAQRRSVHHALSDVQAEVSTKLKQLV^^ 

AARPEPESLQARIjGQLQIUJLSALHMVrSQREEELQSTLKNMDSVLKQHAEEIKELYSBSD 
ETFDQISKVERQVEELLVNHTGLRELRVILMBKSLIMEENKEEIERQLliELNLTLQHLHA 
GHADLIKYVKDCNCQRWSDVDVAPEGHRPVMHTLEETQVSII)EQHQLOGSSLQALQSTV 
DAMS SAMDAYRGEGERARAERAR I RS QLRALDHAVE ALKTAANGTRKE I RLLHGS FTAX.L 
EDAIiRHQAVLAALFGEEMIDEMSEEAPRPLPLDYEQlRbAriQDAASGLQEQAIGWEDLAT 
RVEALEKAAGGFVEQHPQLAEGLEPSHDSGRBBEAMTLAEIiEQEIRRLSSDVKQlGQCCE 
AS WAAS l^SS LEDLHSMLLDTQHGLRQHRQLFHNLFQNFQGti VASN I S LDLGKLQAMLS K 
KDKKQPRGPGESRKRDKKQWMSTDAHAKGLELWBTGSPVAFYAGSSEGATAtQMVKFNT 
TS INVGSS YPPEHGYFRAPKRGVYLFAVS ITFGPGPGMGQLVFEGHHRVPVYSTEQRGGS 
TAHHFCYGRATKG 

SEQ ID NO: 91 

gi|27668203|ref)XM_224646.1| Rattus norvegicus similar to elastin microfibril interfacer 3; 
EMELIN-like protein EndoGlyx-1 (LOC306288), mRNA 

1 atgatcctga cactgctgct gggccttgcg gggtacctga gctggggact gctggggtcc 

61 tgggcacagg accctggttc caggttctcc aaccctaata ggctcagcgt gcctgaaggc 

121 tggaggatag gggctgacga cgataccagc agagacccca gccaacggaa ctggtgtcct 

181 taccagaagt ccaggctggt cacctttgta gctgcctgca aaacagagaa attcctggtt 

241 cattcacaac agccatgtcc gcagggcgct cctgactgcc agagagtcaa agtcatgtat 

301 cgagtggccc agaagccagt gtaccaggtc cagcagaagg tgctgatgtc tgtggactgg 

361 aggtgctgcc cagggttcca gggtccagac tgccaggacc atcatcccac agcaaacccc 

421 gagcccacag aggcaagtgg taaactccag gagacttggg actcattgga tggctttgaa 

481 cctggtcacc ctgtcacaga gtttaatgag attaaggcgc cacaggaaca cctgcttcaa 

541 aatctccaga atgacgccca gccggtagaa gatggctttg caggcccttg ggaggcccca 

601 tccagcaacc ccacagctgc aatgacagaa gccaatctga cagaatttga gtctcttggc 

661 aggacatcga aacatctgct gcagccccat atccatgcat tcctggaagc acacttcagt 

721 cccatctgga agagcttcaa cgagagctta cagagcccct tccaggceat cagaaatctg 

781 tctcttgatg tggaggccaa tcaccgggcc atcaagatga tccaggaggg caccgtggct 

841 agggctgact tccgagagct tggtgccaag tttgaggcca aggtccagca gaatagccag 

901 agactgggcc agctgtggca ggatgtggag gaccagctgc atgcccagcg ccgatccgtg 

961 catcatgccc tctctgaggt ccaggctgag gtgagcacca agttaaagca gcttgttaaa 

1021 gctcaggaac ttccaggggc caacggcagc ctggcggtgg catctgcggc aaggccagag 

1081 ccagagagcc tgcaggccag actggggcag ctgcagagaa acctctccgc tctgcacatg 

1141 gtcactaacc agagggagga ggagttgcgg ggcaccctca aggacatgga cagcgtcctg 
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1201 aggcagcaca cggatgaaat caaggagctc tattctgaat cggatgagac cttcgaccag 
1261 atcagcaaag tagagaggca ggtggaggag ctgctggtga accacacggg gcttcgagag 
1321 ctacgagtga tcttaatgga gaagtccctg atcatggagg agaacaaaga ggagatggag 
1381 cggcaactct tggacctcaa cctcactctg cagcatctgc aggcagctca tgcagatctt 
5 1441 atcaaatatg tcaaggactg caactgccga agggtctact ccgacatgga tgtcatcccg 
1501 gagggccgca gggatgccat gcacacccta gaggagaccc aagtaagccc ggacgaacag 
1561 caccagctag acggttcttt gcaggccttg caaagcactg tagacgccat gtcttcggca 
1621 ttggatgcct acagaggaga gggtgaacgg gcccgggctg agagggcccg gatgcggagc 
1681 cagctgcggg ccctggacca cgctgtggaa gcactgaaga ccgcggcgaa cgggacccgc 

10 1741 aaagagatac gcctgctgca tggctccttc gcagccctac tggaggatgc actgcgacat 
1801 caggccgtgc tggccgcact cttcggggag gagatggtgg acgagatgtc ggaggagcct 
1861 cctcgccctc tgcccctgaa ttatgagcag atccgcctgg ccctgcagga cgccgccagt 
1921 gggctgcaag agcaggcggt tggttgggag gacttggcca ctcgggtgga ggctttggag 
1981 aaggccgcag gtggctttgt ggagcagcac ccacggtcgg cagagggact tgagcccagc 

IS 2041 cacgactttg ggagagagga ggacaccagg gctttggggg acctggagca agagattcag 
2101 cgcctgagct cggatgtcaa gcagatgggg cagtgctgcg aggcctcctg ggccctcaat 
2161 ggctcccttg aagacctaca cagcgcgctc tctgaeacce agcacagcct gagacagcac 
2221 cagcagctct tccgtggcct cttccacaac ttccaagggc tgttggcaag caacaccagc 
2281 ctagacctgg ataagctgca ggccatgctg agtaagaaag acaagaagca acagaaaggc 

20 2341 ccgggagaat cccggaagag ggataagaaa caagtagtga tgtctgcaga tgccaaaggt 
2401 ctggggctct gggaagcagg ttcccctgtg gccttctatg ccagttcttc agaagaggcc 
2461 accgctctgc agatggtgaa gttcaacgct acatccgtca acgtgggcag cggctacttc 
2521 cccgaacacg gctatttccg agctcccaaa cgtggcatct acctgtttgc agtgagcgtt 
2581 acatttggtc caggccccgg aatggggcag ctggtatttg aagggcatcg ccgggttcca 

25 2641 gtctacagtg cggaacagag gggtgggagc acagccacta cctttgctat ggcagagctg 
2701 caaaagggtg agagggtgtg gtttgagtta attcaagggt cggtgacaaa ggggagccgg 
2761 ccaggcactg catttggagg cttcctgatg ttcaagacct ga 

SEQIDNO: 92 

Amino acid sequence of rat ENDOGLYX1 encoded by the DNA sequence shown in SEQ ID 
30 NO: 91. 

MILTLLLGLAGYLSWGLLQSWAQDPOSRFSKfPNRI*SVPBGWRI(5ADDDTSRDPSQRNWCP 
YQKSRLVTFVAACKTEKykVHSQQPCPQGAPDCQRVKVMTO 

RCCPGPQGPDCQDHHPTANPBPTBASGKLQBTWDSLDGPEPGHPVTEPMBIKAPQBHLLQ 
NLQNDAQPVEDGFAGPWBAPSSMLTAAMTEANLTEFESLGRTSKHLLQPHIHAFLEAHFS 

35 PIWKSFNESLQSLFQAll^SLDVBAjmRAIKMIQEGTVARADFRELGAKFEAKVQQNSQ 
RLGQLWQDVEDQUIAQRRSVHHALSEVQAEVSTKLKOLVKAQELPGANGSLAVASAARPE 
PESLQARLGQLQRNLSALHMVTNQREEELRGTLKBMDSVLRQHTDEIKELVSESDETFDO 
I S KVERQVEELL VNHTGLRELRV I LME KSL I MEENKEEMERQLLDLNLTLQHLQAAHADL 
I KYVKDCNCRR VYSDMD V I PEGRRDAMHTLEETQV6 PDEQHQLDGSLQALQ STVD AMS S A 

40 IjDAYRGEGERARAERARMRSQLRALDHAVEALKTAANGTRKE I RLLHGSFAALLEDALRH 
QAVIAALFGEENVDEMSEEPPRPLPLNYEQIRI^VLQDAAGGLQEQAVGWEDXATRVEALE 
KAAGGFVEQHPRSAEGLEPSHDFGREEDTRALGDLEQEIQRLSSDVKQMGQCCEASWALN 
GSLEDLHSAIiSDTQHSLRQHQQLFRGLFHNFQGLLASNTSLDLDKLQAMLSKKDKKQQKG 
PGESRKRDKKQVVMSADAKGI/^LWEAGSPVAFYASSSEEATALQNVKFNATSVNVGSGYF 

45 PEHGYFRAPKRGIYIiFAVSVTFGPGPGMGQLVFEGHRRVPVYSAEQRGGSTATTFAPlAEL 
QKGBRVWFELIQGSVTKGSRPGTAFGGFLMFKT 

SEQ ID NO: 93 

gi|37540060|ref|XNl 35 1265,1 1 Homo sapiens EGF, latrophilin and seven transmembrane 
domain containing 1 (ELTDl), mRNA 

50 1 atgcaccaat ttttctgcag atgtaaggaa gggctacagg gtgtcactct gggagctgcg 

61 ectggggata aactgattcc tcagtggggc tgtaaeattg ctcagggtca gtggtcacta 
121 ggcagggaat ggagcccagg aggcgtgggg aaaggggatg gaggccggac gtggccagcg 
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161 gattcagggc ttgactcaca cactcacaca aaagaggccg gaaagtgtcc ggaggaagcc 
241 ggcgcgtcac ggaggacgga tccgggaccc tgccgcgccc gcccagcggc gcccggtccc 
301 gggtccacag ccgcactcac tccgccgcgc tctccgccac cgccaccact gcggccaccg 
361 ccaatgaaac gcctcccgct cctagtggtt ttttccactt tgttgaattg ttcctatact 
5 421 caaaattgca ccaagacacc ttgtctccca aatgcaaaat gtgaaatacg caatggaatt 
481 gaagcctgct attgcaacat gggattttca ggaaatggtg tcacaatttg tgaagatgat 
541 aatgaatgtg gaaatttaac tcagfccctgt ggcgaaaatg ctaattgcac taacacagaa 
601 ggaagttatt attgtatgtg tgtacctggc ttcagatcca gcagtaacca agacaggttt 
661 atcactaatg atggaaccgt ctgtatagaa aatgtgaatg caaactgcca tttagataat 
10 721 gtctgtatag ctgcaaatat taataaaact ttaacaaaaa tcagatccat aaaagaacct 
781 gtggctttgc tacaagaagt ctatagaaat tctgtgacag atctttcacc aacagatata 
841 attacatata tagaaatatt agctgaatca tcttcattac taggttacaa gaacaacact 
901 atctcagcca aggacaccct ttctaactca actcttactg aatttgtaaa aaccgtgaat 
961 aattttgttc aaagggatac atttgtagtt tgggacaagt tatctgtgaa tcataggaga 
15 1021 acacatctta caaaactcat gcacactgtt gaacaagcta ctttaaggat atcccagagc 
1081 ttccaaaaga ccacagagtt tgatacaaat tcaacggata tagctctcaa agttttcttt 
1141 tttgattcat ataacatgaa acatattcat cctcatatga atatggatgg agactacata 
1201 aatatatttc caaagagaaa agctgcatat gattcaaatg gcaatgttgc agttgcattt 
1261 gtatattata agagtattgg tcctttgctt tcatcatctg acaacttctt attgaaacct 
20 1321 caaaattatg ataattctga agaggaggaa agagtcatat cttcagtaat ttcagtctca 
1381 atgagctcaa acccacccac attatatgaa cttgaaaaaa taacatttac attaagtcat 
1441 cgaaaggtca cagataggta taggagtcta tgtgcatttt ggaattacte acctgatacc 
1501 atgaatggca gctggtcttc agagggctgt gagctgacat actcaaatga gacccacacc 
1561 tcatgccgct gtaatcacct gacacatttt gcaattttga tgtcctctgg tccttccatt 
25 1621 ggtattaaag attataatat tcttacaagg atcactcaac taggaataat tatttcactg 
1681 atttgtcttg ccatatgcat ttttaccttc tggttcttca gtgaaattca aagcaccagg 
1741 acaacaattc acaaaaatct ttgctgtagc ctatttcttg ctgaacttgt ttttcttgtt 
1801 gggatcaata caaatactaa taagctcttc tgttcaatca ttgccggact gctacactac 
1861 ttctttttag ctgcttttgc atggatgtgc attgaaggca tacatctcta tctcattgtt 
30 1921 gtgggtgtca tctacaacaa gggatttttg cacaagaatt tttatatctt tggctatcta 
1981 agcccagccg tggtagttgg attttcggca gcactaggat acagatatta tggcacaacc 
2041 aaagtatgtt ggcttagcac cgaaaacaac tttatttgga gttttatagg accagcatgc 
2101 ctaatcattc ttgttaatct cttggctttt ggagtcatca tatacaaagt ttttcgtcac 
2161 actgcagggt tgaaaccaga agttagttgc tttgagaaca taaggtcttg tgcaagagga 
35 2221 gccctcgctc ttctgttcct tctcggcacc acctggatct ttggggttct ccatgttgtg 
2281 cacgcatcag tggttacagc ttacctcttc acagtcagca atgctttcca ggggatgttc 
2341 atttttttat tcctgtgtgt tttatctaga aagattcaag aagaatatta cagattgttc 
2401 aaaaatgtcc cctgttgttt tggatgttta agctgttgaa atgaagtctg ccaaatcttg 
2461 ctctaacaaa taaaatgtta tctaaatgaa 

40 SEQIDNO:94 

Amino acid sequence of human ETL encoded by the DNA sequence shown in SEQ ID NO: 
93. 

MHQFFCRCKEOLQOVTLOAAPGDKLIPQWOCNIAQGQWSLQREWSPGQVOKODOGRTWPA 

DSGLDSHTHTKEAGKCPEEAGASRRTDPGPCRARPAAPOPGSTAALTPPRSPPPPPLRPP 
45 PMKRLPLLWPSTLLNCSYTQKCTKTPCLPNAKCEIRNGIBACYCNMGPSGNGVriCBDD 

NECGNLTQSCGBNANCTNTEGSYYCMCVPGFRSSSNQDRPITNDGTVCIENVNANCHLDN 

VCIAANINKTLTKIRSIKEPVALLQEVYRNSVTDLSP 

I SAKDTLSNSTLTEKVKTVNNFVQROTFVVWDK^ 

FQKTTEFDTNSTDIALKVFFFDSYNMKHIHPHMNMDGD^ 
50 VYYKSIGPLI*SSSDNFLLKPQNYDNSBBEERVIS8VISVSMSSNPPTLYELEKITFTLSH 

^RK\H?DRYRSLCAFWtTYSPDTMNGSWSSEGCELTYSNETHTSCRCiniLTHFAILMSSGPSI 

GIKDYNILTRITQLGIIISLICLAICIFTFWFFSEIQSTRTTIHKNLCCSLFLAELVFLV 

GItmmnaFCSIIAGLLHYFFUUFAWMCIEGlHLYLIWGVIYNKGFLHKN 

SPAVWGFSAAIiGYRYYGTTKVCWLSTBNNFIWSFIGPACLI ILVNLLAFGVI I YKVFRH 
55 TAGLKPEVSCFENIRS CARGALALLFLLGTTWI FGVLHVVHAS VVTAYLFTV6NAFQGMF 

IFLFLCVLSRKIQEBYYRLFKNVPCCFGCIiSC 
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gi|18875377|ref|NM_l 33222.1) Mus rausculus EGF, latrophilin seven transmembrane domain 
containing 1 (Eltdl), mRNA 

1 acgcggggac aggggacaga gcgagaccct gcggtgcctg cctagctcca ggacagaccc 
S 61 gcactctagc ctcgagcgct ctgccgccag ctccgcggct tccaatgaga ctcctcccgc 
121 ttctagtggg tttctccact ttgctgaatt gttcctacac acaaaactgc agcaagacaa 
181 cgtgtctccc caatgccaag tgcgaagtgc acaatggtgt ggaagcctgc ttctgcagcc 
241 aggggtactc tgggaatggt gtcacgattt gtgaagatat- agatgagtgc agcgagtett 
301 ctgtctgcgg cgatcatgct gtgtgtgaaa acgcgaacgg gggcttcagc tgcttctgca 
10 361 gggaaggtta tcagaccgcc acggggaagt cacagttcac acctaatgat ggctcttact 
421 gccaagatat agatgagtgc agcgagtett ctgtctgcgg tgatcatget gtgtgtgaaa 
481 aegtgaaegg gggcttcagc tgcttctgca gggaaggtta tcagaccgcc acggggaagt 
541 cacagttcac acctaatgat ggctcttact gecaagaaag catgaattca aattgecact 
601 tagagcatgc ctgcatcgct gc&aacatta ataaaacttt aaaaagaatt ggacccataa 
IS 661 cagaacagac aaetttactc caagaaatct acagaaattc tgaggctgag ctctctctga 
721 tggatatagt cacatacata gagatcctaa ctgaatcatc ctcactacta ggccacccga 
781 acagcaccac ttcatacaag gatgcccact tcaactcaac tcttactgaa tttggggaaa 
841 ccatcaataa ttttgttgaa aggagtacac ataaaatgtg ggaccagtta ccgacaaatc 
901 acagaagact tcatctcaca aaactgatgc acactgetga gctagtcacc ttacagatcg 
20 961 ctcagaacat ccagaagaat tctcagtttg atatgaattc tactgacttg gctctcaagg 
1021 tttttgcttt tgattcaact cacatgaagc atgctcaccc ccacatgaat gtggatggag 
1081 gctatgtgaa aatatcccca aggagaaagg ctgcacatgg cacaactggc aatgtagtag 
1141 ttgeattect ctgctataag agcattggtc ccttgctatc ctcatctgac aacttcttac 
1201 tggacactca aaatgataat tctgaaggaa aggaaaaagt catttcttca gtgatttctg 
25 1261 cctcaattag ctcaaatcca cccacattat atgaacttga aaaaattaca tttacactaa 
1321 gtcatgtaaa gctctcagat aagcaccgga cccagtgtgc cttttggaac tactcagttg 
1381 atgccatgaa caatggcarc tggtcaaegg agggctgtga gctgacacac tcaaacgaca 
1441 cccacacctc ctgccgctgt agtcacctga cacactttgc gattttgatg tcctctactt 
1501 cttccattgg gattaaggat tataatatcc tgacgaggat cactcaactc gggataatca 
30 1561 tctccctgat ctgcctcgcc atetgeatet tcaccttctg gttcttcagt gaaatccaaa 
1621 gcaccaggac cacgattcac aagaacctgt gctgcagcct etttcttgea gaacttgttt 
1681 ttcttattgg gatcaacata aatacgaata agttggtctg ctctatcatt gctggcctgc 
1741 tccattactt cttcttagct gcctttgcct ggatgtgcat cgaaggcatt cacctatacc 
' 1801 tcatcgttgt cggcgtcatc tacaacaagg ggtttttaca caagaacttt tatatctttg 
35 1861 gctatctcag cccagctgta gttgttggat tctcagcatc tttaggatac agatattatg 
1921 gaaccacgaa agtatgttgg ctgagcactg aaaacaactt catttggagc tttataggac 
1981 cagcgtgtct aatcattctt gtgaatctct tggcttttgg agttatcata tacaaagttt 
2041 tccgccacac tgctggactg aaaccagaag ttagttgcta tgagaacata aggtcttgtg 
2101 ccagaggtgc cctagccctc ctcttccttc tgggtaccac ctggatcttt ggggttctcc 
40 2161 atgtagtgca tgcatctgtt gtgacagect acctcttcac agtcagcaat gctttccaag 
2221 ggatgttcat tttcttattc ctatgcgttt tgtctagaaa gattcaagag gaatattata 
2281 gattgttcaa aaatgtcccc tgctgctttg gatgtttaag ataaacaatg agaagtcatg 
2341 ataattacag ctgeaatgag atgaaaattc caagattcag ataacctgtg tggcaaaaaa 
2401 tgaagcctgt ttttattgtt agtaattaat ttcaaatcca tttttctgtt cacagtataa 
45 2461 gagatgtagt taatgtgaga taaaattatg gaccagagag ctacagtgtg ttttcttaca 
2521 tgacatagtt agagatatgt caaaaatagt actgeagata tttggaaagt aattggtttc 
2581 tctggagtga tatcactgtg cccaaggaaa gatttctttc taacacgaga aatatatgaa 
2641 tgtcctcaag gaaaccactg gctcgatatc tttgtgactc atgttgcctt tcaaacgagt 
2701 tccctaccac cttagtaatg agttcctttg caggaaggag agcataagag acgtggaggg 
50 2761 gcagagtatg aagcagtgac gaaggcttct ctgacaagga attgtcattc caataaactc 
2921 agcttctcta aacttgatga gaaaatctca agataaaata acgagaaagg aaatatatcc 
2881 tagcagtttg ggaattggtc tgaagtaaaa agccccagat etaaatttge tacatccatg 
2941 ttc'ttcctta ctcttctaaa accagagaaa agecttacaa ctgacattat cagagatgga 
3001 tgctcttaca ctaacattag atttgagtgt aaaatgtttt cattccacac agattaagac 
55 3061 ttcaaatatg tagtcagtaa aacatagatt tgtcaaagta taatactgtt tatgtcttta 
3121 gtgaaaagaa tgtgtgcagt attttgtcta taatatttta ctgttatgaa aattaccttt 
3181 taatattaaa tcagtatact tgaatacttt gctgttttta atcttacaaa tagtgtaatt 
3241 catgttgcaa caggcccttt taattgactg tacataaaag gtcattataa atttaaagta 
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3301 ttgatgaagt gaattataat tcttttctga tcagaaaata cacaattaaa gcattattta 
3361 taacaaataa gaagtcactg agtgctgtag gggtttcaca gtgggtctag ttttagactg 
3421 tttctactat ctctcaragt ctattggctc aaatgtatgg ctctatctat tctcttgttc 
3481 aaatgaagag gcagattttt tttcagaagt gagtcattgt tctgaacctt cstgaacagc 
S 3541 ataattcaat ctactggaca ttgcaatttt aattcttgtg ctgttgaatg aagcctgtcg 
3601 agacctctcc tgaaaaatga acagtcagct ggatgaagca gccctatcgc tgcctgaccg 
3661 agttgttctc tcaggagaga ccactcacct gtcaagaagg gctttgcatt tctagagcct 
3721 gtgagatggt acactttgac taaatctctg gatttcttct gtgctaagtc tgtggcccat 
3781 gactgccatt gtcattctgg gttgggactg tagaaatagg atatcaaaac ctagtctgct 
10 3841 caatcagtgg atatgaaact attgcacgtg gtagacagag tgaccctcca caaatatctg 
3901 tacgctcctc ttcagagcct gtgbctgtgg tggatggcga ggagcaggag ggccctgtgk 
3961 ggagggaggg tagctgacct ccactcagtt ggagcactct cactaccctg agggaagtca 
4021 gcatgctgct gatgtatttc agtgtgggtc tcctggtttg gaactctcat ttctagagct 
4081 acaagacaat aaaattctat tatcaaagcc aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 

15 

SEQIDNO:96 

Amino acid sequence of mouse ETL encoded by the DNA sequence shown in SEQ ID NO: 

95. 

MRLLPLLVGPSTLIjNCSyTQNCSKTTCLPNAKCEVHNQVEACFCSQGYSGNGVTICEDID 
20 ECSESSVCGDHAVCENVNGGFSCPCREGYQTATGKSQFTPNDGSYCQDIDECSESSVCGD 
HAVCENVNGGFSCFCl^QYQTATGKSQFTPNDGSYCQESMNSNCHIiEHACIAANIKKTLK 
RIGPITEQTTLLQEIYRNSEAELSLMDIVTYIEILTESSSL^ 

TEFGETINNFVERSTHKMWDQIiPTiraRRLHLTKLMHTAELVTLQIAQNIQKNSQFDMNST 
DLALKVF AFDSTHMKHAHPHMNVDGG YVKI S PRRKAAHGTTGNWVAFLCYKS IGPLLSS 

25 SDNFXiliDTQNDNSEGKEKVISSVISASISSNPPTLYELEKITFTLSKVKLSDKHRTQCAP 
VWYSVDAMNNGXWSTEGCELTHSNDTHTSCRCSHLTHFAILMSSTSSIGIKDYNILTRIT 
QLG 1 1 1 SL I CLAI C I FTFWFFSE IQSTRTTIHKNLCCSIjFLAELVFL IG IN INTOKLVCS 
1 1 AGLLH Y FFLAAFAWMC1 EGIHLYL I VVGVI YNKGFLHKNFY I FGYLS PAVVVGFSASL 
GYRYYGTTKVCWLSTENNF I WSF IGPACL 1 1 L VNLLAFG VI I YKVFRHTAGLKPEVS CYE 

30 NIRSCARGAWILLFLLGTTWIFGVLHVVHASVVT 
QEEYYRLFKNVPCCFGCLR 

SEQ ID NO: 97 

gi|l 15601 l0|reflNM_022294.1| Rattus norvegicus ETL protein (EtI), mRNA 

1 ccacaggctg agactagagt ccaggctgtt tgggtgaagg ggcctggcgg ccggacgtgg 

35 61 cctgcagagt ctgggctgtg cacacattca cacaaaagag gccgggaagt gacaggagga 
121 agctgtgcgt cacaagggac tgagcgggac cctgccgcgc ctgcccagct ccaggacaga 
181 ccccaactct tgccttcagc gctctgcgga gccagccagc tccacccggc ttccaatgag 
241 actcctcctg cttctagtgg gtctctccac tttgctgaat cactcctaca cacaaaactg 
301 caagacaccg tgtctcccaa atgccaagtg tgaggtgttg gacgaagtgg cagcctgctt 

40 361 ctgcagtaca ggctacactg ggaatggcat cacgatttgt gaagatgtag acgagtgcaa 
421 cgagacctcc gtctgcggtg atcacgctgt gtgtgaaaac acgaatggag gatttagctg 
481 cttctgcgtg gaaggttatc agacctccac cgggaagacg cagttcacge ctaatgatgg 
541 ctcttactgc caagatgtag acgagtgcaa cgagacctcc gtctgcggtg atcacgctgt 
601 gtgtgaaaac acgaacggag gatttagctg cttctgcgtg gaaggttatc agacctccac 

45 661 cgggaagacg cagttcacge ctaatgatgg ctcttactgc caagaaattg tgaattcaaa 
721 ttgecactta gagcatgact geattgetge aaacattaat aaaactctaa aaagaattgg 
781 acccataaca gaacagctga ctttactcca tgaaatctac aagaattctg aggctgagct 
841 ttctctggtg gatatagtca catacataga gatactaaca gaatcatcct cactacaagg 
901 ctacataaag aacaccactt cgcccaagga tgcctacttc ggttcagctc ttactgaatt 

50 961 tggaaaaacc gecaataatt ttgttgaaaa gaacacacat gaaatgtggg accagttacc 
1021 tacaaatcgt agaagactcc atctcacaaa actgatgeae getgetgage acgtcacctt 
1081 acagatctct cagaacatcc agaagaatac tcagtttgac atgaattcta ccgacttggc 
1141 tctcaaggtt ttcgtttttg attcagttca catgaagcat actcatcccc atatgaatgt 
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1201 ggacggaggc tatgtaaaaa tatccccgag gagaaaatct gcatatgacc caaatggcaa 
1261 cgtcattgtt gcattcctgt gctataggag cattggcccc ttgctttcct catctgacga 
1321 cttcttactg ggcgctcaga gtgacaattc caaaggaaag gagaaggtca tttcttcagt 
1381 gatttctgcc tcaattagct caaacccacc cacactgtat gaacttgaaa aaattacatt 
5 1441 tacactgagt catgtaaagc tctcagataa gcaccagaca cagtgcgcct Cttggaacta 
1501 ctcagtcgat gacatgaaca atggcagctg gtcatctgag ggctgtgagc tgacataetc 
1561 caacgacacc catacttcct gccgatgtag tcatctgaca cactttgcga ttttgatgtc 
1621 ccccagtacc tccattgaag ttaaagatta caatatcctg acgaggatca ctcagctggg 
1681 aataatcatc tccctgatct gcctcgccat atgcattttc accttctggt tcttcagtga 

10 1741 gattcaaagc accaggacca caatccacaa gaatctctgc tgcagcctct ttcttgcaca 
1801 actagttttt cttgtcggca tcaacataaa cacaaacaag ctggtctgct ctatcatcgc 
1861 tggcctgctc cattacttct tcttagctgc ctttgcctgg atgtgcattg aaggcatcta 
1921 cctatacctc atcgttgttg ggctcatcta taacaagggg tttttacaca agaacttcta 
1981 tatctttggc tatcttagcc cggctgtagt tgttggattc tcggcctctt tgggatacag 

IS 2041 atattatggt accaccaaag tatgttggtt gagcactgaa aacaacttta tctggagctt 
2101 catagggcca gcgtgtctaa tcattcttgt taatctcttg gcttttggag ttatcatata 
2161 caaagtgttc cgccacactg ctggactgaa gccagaagtt agttgctacg agaacataag 
2221. gtcttgcgcc agaggagccc tggccctcct cttccttctg ggtaccacct ggacctttgg 
2281 ggttctccac gtagtgcatg catctgttgt gacagcctac ctcttcacag tcagcaacgc 

20 2341 tttccaaggg atgtttattt tcttattcct atgtgtttta tctagaaaga ttcaagaaga 
2401 atattacaga ttgttcaaaa atgtcccctg ctgttttgaa tgtttaagat aaacaacgag 
2461 aagacacaat aattatagct gaaatgaaat ggaaattcca agatttcgga tagcctgtgt 
2521 gacaaaaatg agcctgcctt cattgttagt aattaatttc aaattcgctt ttctgttcgc 
2581 agtataaaag atgtagttaa tgtgagataa aattatgggc cagagagctc ctgtgtgttt 

25 2641 tcctacatga catagttaga tatgtcaaaa atagtactgc agatatttgg aaagtaattg 
2701 gtttctctgg agtgatatca ctgtgcccaa ggaaagattt ctttctaaca caagaaatag 
2761 atgaatgtcc tcaaggaagc gactggcttg atatctttgt gactcatgtt gcctttcaaa 
2821 cgagtcccct accaccatag taatgagttc ctttgcagaa aggagagtat aagaaacttg 
2881 gaggggcaga atatgaagca atggagaagc cttctctgac aaggaattgt cattccaata 

30 2941 aaattggctt tctccaaaat tgaagaggaa aaaattttca ggctaaaata acgaaaaagg 
3001 aaatgcatcc tagcactttg ggaattggtc tgaacttaaa aggcccagac ctaaatttac 
3061 tacatccatg ttcttcctta ctgttctaaa ccaaagaaaa accttaaaat ttacagatac 
3121 atggatgagt gttctcacat aacatcatat ttgaatgtaa atttttttca ttcctcacag 
3181 attaagactt cagcaacata tttggtaaaa cataaatttg tcaaactata agactgttca 

35 3241 tatctttagt gaaaaaatag aatgtgaagt attttgtcta taatatttta ctgttatgaa 
3301 aataatcttt tcatattaga gcagtatact tgaatacttt actgttttta atcttacaaa 
3361 tagtgtgatt catgttgcaa ccagcccttt taattgactg tatttaaaag ggcattataa 
3421 atttaaacta ttgatgaagt aaattataat ggttttctga tcagaaaata catacttaaa 
3481 gcattattta taacaaataa aaagtcactg agcactgcag gggtttcaca gtggatctga 

40 3541 tatttttaga ccgtttccta tcacctatca gtctatttac ttaaatgtac agctctacca 
3601 attctcttac tcaaaggaag aggcagtatt tttctcagaa gtgagtcatt gttctgtacc 
3661 ttcctggaga catgattcga tccattgaae attgtggttt taattcttgt gctgttgaat 
3721 gaagcctgac aagacacctc ctaaaaaatg aaatgtcagc tggatgaagc agccctgcta 
3781 ctgcctgact gagttgttct cteaggaaag accactcacc tgccaagaag cacgttgcat 

45 3841 ctctacagat ctcagggttt ctcccatgcc aagtctgtag cccacgagca tcattgtcat 
3901 tctaagatgg gactgtagaa ataggatatc aaaacataat ccgttcaatc aatggataag 
3961 aaactatcac atgtagtaga cagaataacc cttctcaaat attcatacac tcctcttcac 
4021 aagctgtggc cgtggtggat agtgaggagc aggaggtcct gtcaggagga agagtagctg 
4081 aggtccactc agttggagaa ggctctcact gtgctggggg aagtcagcat gctgacgatg 

50 4141 ttactttagt ttgggtctct tgttttggae atctcatttc tagagctgta aagacaataa 
4201 aattctatta tcaaagccaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
4261 aaaaaaaaaa aaaa 

SEQIDNO: 98 

Amino acid sequence of rat ETL encoded by the DNA sequence shown in SEQ ID NO: 97. 

55 MRLLLLLVGLSTLLNHSYTQNCKTPCLPNA 

CNETSVCX3DHAVCENTNGQPSCFCVEGYQTSTGKTQFTPNDOSYCQDVDECNETSVCODH 
AVCENTNOTFSCFCVBGYQTSTGKTQFTPNDGSYCQEI 
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IQPITEQLTLLHBIYKNSEAELSLVDIVTYIEILTESSSLCK3YIKNTTSPKDAYFQSALT 

EFGKTV^FVEKNTHEMVJDQLPTNRRRLHLTKLMHAAEHVTLQISQN 

LALKVFVFDSVHMKHTHPHMNVDGGW 

DDFXLOAQSDNSKGKEKVISSVISASISSNPPTLYELBK1TFTLSHVKLSDKHQTQCAPW 

NYS VDDMNNGSWSSEGCEIiTYSNDTHTSCRCSHLTHFAI LMS PSTS I E VKD YN I LTR ITQ 

LOIIISLICLAICIFTFWFFSEIQSTRTTIHKNLCCSLFLAQLVFLVQININTNKLVCSI 

I AGLLHYFFLAAFAWMC I EG I YLYLI WGl#I YWKGFLHXNF Y I FG YLS PAWVGFS ASLG 

YRYYGTTKVCWLSTENNFIWSFIGPACLI ILVNLLAFGVI I YKVFRHTAGLKPBVSCYEN 

IRSCARGALALLFLIXJTTWTFGVLHVVHASV^ 

EEYYRLFKNVPCCFECLR 

SEQIDNO: 99 

gi|31982926|reflNM_023928.2| Homo sapiens acetoacetyl-CoA synthetase (AACS), mRNA 

1 cgctgaccca gcccgccagg cgctcctgac cgtcgcttcc tccggtccca ggtccccggc 
61 cctcgcctca gccccggccc ctggtcccca gccctcgtcg cagccccggc cgccqgccgc 
121 cgccatgtcc aaggaggagc gccccggtcg ggaggagatc ctggagtgcc aggtgatgtg 
181 ggagcctgac agtaagaaga acacgcagat ggaccgcttc cgggcggctg tgggcgccgc 
241 ctgcggcctg gcgctggaga gttatgatga cttgtaccat tggtccgttg agtcatattc 
301 agacttctgg gcagagttct ggaaattcag tggaattgtc ttctcacgtg tgtatgatga 
361 ggttgtggac acatcgaaag gaatcgcaga tgtccccgag tggttcaaag gcagtcggct 
421 caactatgca gaaaacctcc tgcggcacaa agagaatgac agagttgccc tttacattgc 
481 aagggaaggc aaagaggaaa ttgtgaaggt gacttttgaa gagctgaggc aagaagtggc 
541 tttgtttgca gcagcaatga ggaaaatggg tgtgaagaaa ggagatcggg ttgttggtta 
601 tttacccaac agtgagcacg ctgtcgaggc gatgctggct gcggcaagca ttggtgccat 
661 ctggagctcc acgtccccgg acttcggtgt gaatggtgtg ctggaccggt tttctcaaat 
721 tcagccaaag ctcatcttct ctgtggaggc tgttgtctat aatggcaaag agcacaacca 
781 catggaaaag ctgcagcagg tggttaaagg cctaccagac ttgaagaaag tggtggtgat 
841 tccttatgtg tcctccagag agaacataga cctttcaaag attccaaaca gtgtgtttct 
901 ggatgacttt cttgccaccg gcaccagtga gcaggccccg cagctggagt tcgagcagct 
961 gcccttcagc cacccactgt tcatcatgtt ctcatcgggc accacgggcg cacccaagtg 
1021 catggtgcat tccgctgggg gcaccctcat ccagcatctg aaggagcacc tgctgcacgg 
1081 caacatgacc agcagtgaca tcctcctgtg etacaccacg gtcggctgga tgatgtggaa 
1141 ctggatggtg tcccttctgg ccacaggagc ggccatggtc ttgtacgatg gctcccccct 
1201 ggtgcccacg cccaatgtgc tctgggacct ggttgacagg ataggcatca ctgtcctggt 
1261 aactggggcc aagtggctgt cagtgctgga agagaaggcc atgaagccgg tggaaaccca 
1321 cagtctccag atgctccaca cgatcctgtc cactggctcc ccactgaaag cccagagcta 
1381 cgagtatgtc tacaggtgca tcaagagcag catcctcctg ggctccatct caggaggcac 
1441 cgacatcatc tcctgcttca tgggccacaa tttttctctt cctgtgtata aaggggagat 
1501 tcaggcccgg aacctgggca tggccgtgga agcgtggaac gaggaaggaa aggcggtctg 
1561 gggagagagc ggcgagctgg tgtgtactaa gccgatccct tgccagccca cacacttctg 
1621 gaacgatgag aacggcaaca agtacaggaa ggcgtatttc tccaaattcc caggtatctg 
1681 ggetcatggc gactactgca gaatcaaccc caagaccggg ggcatcgtca tgcttggccg 
1741 gagtgacggc aocctcaaoc ccaacggggt gcggttcggc agctcggaaa tctataacat 
1801 tgtggaatcc ttcgaggagg tggaggacag cctgtgtgtc ccccagtata acaagtacag 
1861 ggaggagagg gtgatcctct tcctgaagat ggcctccggg cacgccttcc agcctgactt 
1921 ggttaagagg atccgtgacg ccatccgcat gggcttgtct gcgcgacacg tgcccagcct 
1981 catcctggaa accaagggca tcccgtatac gctcaacggc aagaaagtgg aagttgccgt 
2041 caaacagatc atcgctggaa aagccgtgga gcaaggaggt gotttctcga accccgagac 
2101 ectggatctg taccgggaca tccctgagct gcagggcttc tgagtcagac tggctggcgt 
2161 gtcactcagc cgcacccgtg tgcactgtaa cttttgtgtg ctcaagaaat tatacagaaa 
2221 cctacagctg ttgtaaaagg atgctcgcac caagtgttct gtaggcttgg ggagggatcg 
2281 tttctctgtt ttgttaaatc tggtgggtac ctggatcttc cacacgagtg ggattctggc 
2341 cttcagagac caggagggag tgtctg$gcc 9caggtgtgg cactgtggtg agagtgtgtg 
2401 tctttgcaca cacagtgcag tgggaacggt ggggctggct ggtgctgaag acagacacac 
2461 tcctgagcca aggtcttgtc ttcaacctcc ccgtcccgtt gtcccatttt gctctgtgaa 
2521 ggtgcaaatc cotttcttcc cttcccatct caggctctcc tgttttccct cagggtccag 
2581 tatgcctttg agctttagct gttagaaagg aacccocgtg acttgacaca gctttcacag 
2641 ctggctgcta ggaccggcgg gctgggtgtt cacgtgtgtc tgtgtcatgg atgcaatgca 
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2701 ggccctggag gactgtgcgt cacccgtcaa ccagagcgtg cctccgggcc agcttccctc 
2761 caaggaatga gtggatttca tacaggatct ctttattgca cagactgaat ggctttacat 
2621 gtttctaatg tgaattaggc atgtgaagca gtgggtgtcc acccgtgtcc ctcatgggtg 
2861 agccctccag ctgtgagccc aggcagtgtg gtcaccgagt gaggaccctc ctcaccagga 
5 2941 accgcatccc tgtgctgcct ccacctgaga gttgctaggg ggttcttgtc gagatcatgt 
3001 catcagcacc cctaagtcaa gtcacgggtt tccatagcca ggcagttggt atptacaatt 
3061 cagttcagcg tatgaacttg tatctctaat ctgatgtcca tttttatatt ttttgaaact 
3121 gagcacaatg aaatcctttc ttgaatcatt ttccttttgg attataaaaa tatgggggaa 
3181 agtgctatga tgagttttat gcaataaatg tatacatgtg tgcacatgca cccaaaaaaa 
10 3241 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
3301 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 

SEQIDNO:100 

Amino acid sequence of human FIJI 2389 encoded by the DNA sequence shown in SEQ ID 
NO: 99. 

1 5 MSKEERPGREEILECQVMWEPDSKKXTC'QMDRFRA^ 

FWAEFWKFSGIVPSRVYDEVVDTSKGIADVPEWFKGSRl»NYAENLLRHKEmRVALYIAR 
EGKEEIVK\rrFE£LRQEVALFAAAMRKMGVXKGDRWGYLPNSEHAVEAMLAAASIGAIW 
SSTSPDFGVNGVLDRFSQIQPKLIFSVEAVVYNGKEHNHM^ 

YVSSRBNIDLSK1PNSVFLDDFLATGTSEQAPQLEFEQ1.PFSHPLFIMFSSGTTGAPKCM 
20 VHS AGGTL I QHL KEHLLHGNMTS S D I L LC YTTVG WMMWNWMVS LL ATG AAMVL YDGS PLV 
PTPNVLWDLVDRIG I TVLVTGAKWLS VLE EKAMKPVETHS LQMLHTI L STGS PLKAQS YE 
YVYRC I KS S I LLGS I SGGTDI I S CFMGHNFSLP VYKGE I Q ARNLGMAVBAWNEBGKAVWG 
ESGEIiVCTKP I PCQPTHFWNDRNGNKYRKAYFS KFPG I WAHGD YCR INPKTGG I VMLGRS 
DGTLNPNGVRFGSSEIYNIVESFEEVEDSLCVPQYNKYREERVILFLKMAflGHAFQPDLV 
25 KR I RDAIRMGLS ARHVPS L I LET KG I P YTLNGKKVEVAVKQI I AGKAVEQGG AFS N P ETL 
DL YRD I PELQGF 

SEQ ID NO: 101 

gi|10436299|dbj|AK024036.1| Homo sapiens cDNA FU13974 fis, clone Y79AA1001581, 
weakly similar to ACETYL-COENZYME A SYNTHETASE (EC 6.2.1.1) 

30 i cccttgttcc ccgccgccgc cgtcgctgac ccagcccgcc aggcgctcct gaccgtcgct 

61 tcctccggtc ccaggtcccc ggccctcgcc tcagccccgg cccctggtcc ccagccctcg 
121 tcgcagcccc ggccgtccgc cgccgccatg tccaaggagg agcgccccgg tcgggaggag 
181 atcctggagt gccaggtgat gtgggagcct gacagtaaga agaacacgca gatggaccgc 
241 ttccgggcgg ctgtgggcgc cgcctgcggc ctggcgctgg agagttatga tgacttgtac 
35 301 cattggtccg ttgagtcata ttcagacttc tgggcagagt tctggaaatt cagtgaaatt 
361 gtcttctcac gtgtgtatga tgaggttgtg gacacatcga aaggaatcgc agatgtcccc 
421 gagtggttca aaggcagtcg gctcaactat gcagaaaacc Ccctgcggca caaagagaat 
481 gacagagttg ccctttacat tgcaagggaa ggcaaagagg aaattgtgga ggtgactttt 
541 gaagagctga ggcaagaagt ggctttgttt gcagcagcaa tgaggaaaat gggtgtgaag 
40 601 aaaggagatc gggttgttgg ttatttaccc aacagtgagc acgctgtcga ggcgatgctg 
661 gctgcggcaa gcattggtgc catctggagc tccacgtccc cggacttcgg tgtgaatggt 
721 gtgctggacc ggttttctca aattcagcca aagctcatct tctctgtgga ggctgttgtc 
781 tataatggca aagagcacaa ccacatggaa aagctgcagc aggtggttaa aggcctacca 
841 gacttgaaga aagtggtggt gattccttat gtgtcctcca gagagaacat agacctttca 
45 901 aagattccaa acagtgtgtt tctggatgac ettcttgcca ccggcaccag tgagcaggcc 
961 ccgcagctgg agttcgagca gctgcccttc agccacccac tgttcatcat gttctcatcg 
1021 ggcaccacgg gcgcacccaa gtgcatggtg cattccgctg ggggcaecct catccagcat 
1081 ctgaaggagc acctgctgca cggcaacatg accagcagtg acatcctcct gtgctacacc 
1141 acggtcggct ggatgatgtg gaactggatg gtgtcccttc tggccacagg agcggccatg 
50 1201 gtcttgtacg atggctcccc cctggtgccc acgcccaatg tgctctggga cctggttgac 
1261 aggataggca tcactgtcct ggtaactggg gccaagtggc tgtcagtgct ggaagagaag 
1321 gccatgaagc cggtggaaac ccacagtctc cagatgctcc acacgatcct gtccactggc 
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1381 tccccactga aagcccagag ctacgagtat gtctacaggt gcatcaagag cagcatcctc 
1441 ctgggctcca tctcaggagg caccgacatc atctcctgct tcatgggcca caatttttct 
1501 cttcctgtgt ataaagggga gattcaggcc cggaacctgg gcatggccgt ggaagcgtgg 
1561 aacgaggaag gaaaggcggt ctggggagag agcggcgagc tggtgtgtac taagecgatc 
5 1621 ccttgccagc ccacacactt ctggaacgat gagaacggca acaagtacag gaaggcgtat 
1681 ttctccaaat tcccaggtat tgggctcatg gcgactactg cagaatcaac cccaagaccg 
1741 ggggcatcgt catgcttggc cggagtgacg gcaccctcaa ccccaacggg gtgcggttcg 
1801 gcagctcgga aatctataac attgtggaat ccttcgagga ggtggaggac agcctgtgtg 
1861 tcccccagta taacaagtac agggaggaga gggtgatcct cttcctgaag atggcctccg 

10 1921 ggcacgcctt ccagcctgac ttggttaaga ggatccgtga cgccatccgc atgggcttgt 
1981 ctgcgcgaca cgtgcccagc ctcatcctgg aaaccaaggg catcccgtat acgctcaacg 
2041 gcaagaaagt ggaagttgcc gtcaaacaga tcatcgctgg aaaagccgtg gagcaaggag 
2101 gtgctttctc gaaccccgag accctggatc tgtaccggga catccctgag ctgcagggct 
2161 tctgagtcag actggctggc gtgtcactca gccgcacccg tgtgcactgt aacttttgtg 

IS 2221 tgctcaagaa attatacaga aacctacagc tgttgtaaaa ggatgctcgc accaagtgtt 
2281 ctgtaggctt ggggagggat cgtttctctg ttttgttaaa tctggtgggt acctggatct 
2341 tccacacgag tgggattctg gccttcagag accaggaggg agtgtctggg ccgcaggtgt 
2401 ggcactgtgg tgagagtgtg tgtctttgca cacacagtgc agcgggaacg gtggggctgg 
2461 ctggtgctga agacagacac actcctgagc caaggtcttg tcttcaacct ccccgtcccg 

20 2521 ttgtcccatt ttgctctgtg aaggtgcaaa tccctttctt cccttcccat ctcaggctct 
,2581 cctgttttcc ctcagggtcc agtatgcctt tgagctttag ctgttagaaa ggaacccccg 
2641 tgacttgaca cagctttcac agctggctgc taggaccggc gggctgggtg ttcacgtgtg 
2701 tctgtgtcat ggatgcaatg caggccctgg aggactgtgc gtcacccgtc aaccagagcg 
2761 tgcctccggg ccagcttccc tccaaggaat gggtggattt catacaggat ctctttattg 

25 2821 cacagactga atggctttac atgtttctaa tgtgaattag gcatgtgaag cagfcgggtgt 
2881 ccacccgtgt ccctcatggg tgagccctcc agctgtgagc ccaggcagtg tggtcaccga 
2941. gtgaggaccc tcctcaccag gaaccgcatc cctgtgctgc ctccacctga gagttgctag 
30.01 ggggttcttg tcgagatcat gtcatcagca cccctaagtc aagtcacggg tttccatagc 
306l..caggcagttg gtatgtacaa ttcagttcag cgtatgaact tgtatctcta atctgatgtc 
.30 3121 catttttata ttttttgaaa ctgagcacaa tgaaatcctt tcttgaatca ttttcctttt 
3181 ggattataaa aatatggggg aaagtgctat gatgaatttt atgcaataaa tgtatacatg 
3241 tgtgcacatg c 

SEQIDNO: 102 

Amino acid sequence of human FLJ 12389 variant ORF number 1 encoded by the DNA 
35 sequence shown in SEQ ID NO: 101. 

MSKEERPGREEILECQVMWBPDSKKNTQMDRPRAAVGAACGIALESYDDLYHWSVESYSD 
FWAEFWKPSEIVFSRVYDEVVDTSKGIADVPEWFKGSRLNyABNIjLRHKENDRVALYIAR 
EGKEEIVEVTFEELROBVALPAAAMRKMOVKKGDRWGYLPNSEHAVEAMLAAASIGAIW 
68TSPDPGVNGVLDRFSQIQPKLIF8VEAVVYNGKEHNHNEKLQQWKGLPDLXKVW 

40 WSSRENIDLSKIPNSWLDDFIiATGTSEQAPQLEFEQLPFSHPLFIMPSSGTTGAPKCM 
VHS AGGT L I QHLKEHL LHGNMT S S D 1 LLC YTTVG WMM WN WMVS LLATG AAMVL YDGS P L V 
PTPNVLWDLVDRIGITVLVTGAKWLSVLEEKAMKPVETHSliQMLHTILSTO 
YVYRCI KSS ILLGS ISGGTDI ISCFMGHNFSLPVYKGEIQARNLGMAVEAWNEEGKAVWG 
ESGBLVCTKPIPCQPTHFWNDENGNKYRKAYFSKPPGIGLMATTAESTPRPGASSCLAGV 

45 TAPSTPTGCGS AARKS ITLWNPSRRWRTACVS PS ITSTGRRG 

SEQIDNO: 103 

gi| 1 04343 1 6|dbj| AK022740. 1 1 Homo sapiens cDNA FU12678 fis, clone NT2RM4002409, 
weakly similar to ACETYL-COENZYME A SYNTHETASE (EC 6.2.1.1) 

l agacacatcg aaaggaatcg cagatgtccc cgagtggttc aaaggcagtc ggctcaacta 

50 61 tgcagaaaac ctcctgcggc acaaagagaa tgacagagtt gccctttaca ttgcaaggga 

121 aggcaaagag gaaattgtga aggtgacttt tgaagagctg aggcaagaag tggctttgtt 

181 tgcagcagca atgaggaaaa tgggtgtgaa gaaaggagat cgggttgttg gttatttacc 
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241 caacagtgag cacgctgteg aggcgatgct ggctgcggca agcattggtg ccatctggag 
301 ctccacgtcc ccggacttcg gtgagaatgg tgtgctggac cggttttctc aaattcagcc 
361 aaagctcatc ttctctgtgg aggctgttgt ctataatggc aaagagcaca accacatgga 
421 aaagctgcag caggtggtta gaggcctacc agacttgaag aaagtggtgg tgattcctta 
5 481 tgtgtcctcc agagagaaca tagacctttc aaagattcca aacagtgtgt ttctggatga 
541 ctttcttgcc accggcacca gtgagcaggc cccgcagctg gagttcgagc agctgccctt 
601 cagccaccca ctgttcatca tgttctcatc gggcaccacg ggcgcaccca agtgcatggt 
661 gcattccgct gggggcaccc tcatccagca tctgaaggag cacctgctgc acggcaacat 
721 gaccagcagt gacatcctcc tgtgctacac cacggtcggc tggatgatgt ggaactggat 
10 781 ggtgtccctt ctggccacag gagcggccat ggtcttgtac gatggctccc ccctggtgcc 
841 cacgcccaat gtgctctggg acctggttga caggataggg tatctgggct catggcgact 
901 actgcagaat caaccccaag accgggggca tcgtcatgct tggccggagt gacggcaccc 
961 tcaaccccaa cggggtgcgg ttcggeagct cggaaatcta taacattgtg gaatccttcg 
1021 aggaggtgga ggacagcctg tgtgtccccc agtataacaa gtacagggag gagagggtga 
IS 1081 tcctcttcct gaagatggcc tccgggcacg cettccagcc tgacttggtt aagaggatcc 
1141 gtgacgccat ccgcatgggc ttgtctgcgc gacacgtgcc cagcctcatc ctggaaacca 
1201 agggcatccc gtatacgctc aacggcaaga aagtggaagt tgccgtcaaa cagatcatcg 
1261 ctggaaaagc cgtggagcaa ggaggtgctt tctcgaaccc cgagaccctg gatctgtacc 
1321 gggacatccc tgagctgcag ggcttctgag tcagaqtggc tggcgtgtca ctcagccgca 
20 1381 cccgtgtgca ctgtaacttt tgtgtgctca agaaattata cagaaaccta cagctgttgt 
1441 aaaaggatgc tcgcaccaag tgttctgtag gcttggggag ggatcgtttc tctgttttgt 
1501 taaatctggt gggtacctgg atcttccaca cgagtgggat tctggccttc agagaccagg 
1561 agggagtgtc tgggccgcag gtgtggcact gtggfcgagag tgtgtgtctt tgcacacaca 
1621 gtgcagcggg aacggtgggg ctggctggtg ctgaagacag acacactcct gagccaaggt 
25 1681 cttgtcttca acctccccgt cccgttgtcc cattttgctc tgtgaaggtg caaatccctt 
1741 tcttcccttc ccatctcagg ctctcctgtt ttccctcagg gtccagtatg cctttgagct 
1801 ttagctgtta gaaaggaatg agtggatttc atacaggatc tctttattgc acagactgaa 
1861 tggctttaca tgtttctaat gtgaattagg catgtgaagc agtgggtgtc cacccgtgtc 
1921 cctcatgggt gagccctcca gctgtgagcc eaggcagtgt ggtcaccgag tgaggaccct 
30 1981 cctcaccagg aaccgcatcc ctgtgctgcc tccacctgag agttgctagg gggttcttgt 
2041 cgagatcatg tcatcagcac ccctaagtca agtcacgggt ttccatagcc aggcagttgg 
2101 tatgtacaat tcagttcagc gtatgaactt gtatctctaa tctgatgtcc atttttatat 
2161 tttttgaaac tgagcacaat gaaatccttt cttgaatcat tttccttttg gattataaaa 
3221 atatggggga aagtgctatg atgaatttta tgcaataaat gtatacatgt gtgcacatgc 



SEQIDNO:104 

Amino acid sequence of human FU 12389 variant ORF number 2 encoded by the DNA 
sequence shown in SEQ ID NO: 103. 

MRKMQVKKGDRWQYLPNSEHAVEAMIiAAASIGAIWSSTSPDPQENOVLDRFSQIQPKLI 
40 FSVEAVVYNGKEHNHMEKIjQQVVRGLPDLKKWVIPYVSSRENIDLSKIPN^ 

TGTSEQAPQLEPEQ^PFSHPLFIMPSSGTTGAPKCMVHSAGGTLIQHLKEHLLHGNMTSS 

DILLCVTTVGWMMWNWMVSLIATGAAMVLYMSPLVPTPNV^ 

QPQDRGHRHAWPE 

SEQ ED NO: 105 

45 gi|21313519|ref|NMJ)30210J| Mus musculus acetoacetyl-CoA synthetase (Aacs), mRNA 

1 gagtctcgcg ctgtggttcg tcggcgcacc gctgatccgc tccacgcctt gcgctctccg 
61 ctctcagcca aagcccggca gccccggcca cgcagctccg caaccatgtc caagctggcg 
121 cggctcgagc gcgaggagat catggagtgc caggtgatgt gggagcctga cagcaagaag 
181 gacacgcaga tggaccgctt ccgggcggcc gtgggtaccg cctgcggcct ggcgcttggg 
50 241 aattacaatg acttatacca ctggtctgtc cggtcgtata tggacttttg ggctgagttc 
301 tggaagttca gtggaatcgt ctactcacgc atgtatgatg aggttgtgga cacatccaaa 
361 gggattgcag atgtccccga gtggttcaga ggcagccgcc tcaactacgc agagaacctc 
421 ctgcggcaca aggagaacga cagagtcgcc ctttacgtgg cccgggaagg cagagaggag 



199 

BN800CI0! <WO__&00e00740CA2J_> 



WO 2006/00740(1 



PCT/US2005/021297 



481 atcgtgaagg tgacttttga agagctgcgg cagcaggtgg ctctgttcgc agctgccatg 
541 aggaagatgg gcgtgaagaa aggggaccgt gtggtcggct atctacccaa cagtgcgcat 
601 gcggtggagg ccatgctggc tgctgccagt attggggcca tttggagttc tacctcgccg 
661 gactttggtg taaatggtgt cctggaccgc ttttctcaaa ttcagccgaa acttatcttc 
721 tcggtggagg ctgttgtcta caatggcaag gagcatggtc acctggagaa gctgcagcga 
781 gttgtgaaag gactgcctga cctgcagcga gtggtgctga tcccctatgt cctcccaagg 
841 gagaagattg acatttccaa gatccccaac agtgtgttcc tggatgattt cctggcaagc 
901 gggaccggcg cgcaggcgcc gcagctcgag tttgagcagc tgcccttcag ccaccctctg 
961 ttcatcatgt tctcctcagg cacgacggga gcgcccaagt gcatggtgca ctcagccggg 
1021 ggcaccctca tccagcacct gaaggagcac atgctacacg gcaacatgac aagcagtgac 
1081 atcctgctct actacaccac ggtcggctgg atgatgtgga actggatggt gtcagccctg 
1141 gccacaggag catccttggt cctgtatgat ggctccccgc tggttccgac ccccaacgtg 
1201 ttgtgggacc ttgtggacag gatagggatc accatcctgg gaacgggagc caagtggctg 
1261 tcagtgctgg aggagaagga catgaagcca gtggaaactc acaacctcca cacgctgcac 
1321 acgatcctgt ccaccggctc gccgctgaaa gcccagagtt acgagtatgt gtacagatgc 
1381 atcaagagct ccgtgctcct gggctccatc tcaggaggca ctgacatcat ctcctgtttc 
1441 atgggccaga actcctctat tcctgtgtac aagggtgaga tccaagcccg gaaccttggc 
1501 atggctgtgg aagcctggga cgaggaaggg aaggccgtct ggggagcgag tggcgagctg 
1561 gtgtgcacta agcccattcc ctgccagccc acgcacttct ggaacgacga gaacggcagc 
1621 aagtaccgga aggcttactt ctccaaattc ccaggtgtct gggcacacgg tgactactgc 
1681 aggatcaacc ccaaaacagg aggcattatc atgctgggcc gtagtgatgg caccctcaac 
1741 cccaatggcg tccgctttgg cagctcggag atctacaaca tcgtggaagc cttcgatgag 
1801 gtggaggaca gcctgtgtgt accccagtac aacagagatg gcgaggagcg ggtggtcctg 
1861 ttcctgaaga tggcgtccgg gcacactttc cagcctgacc tcgtgaagcg catccgagac 
1921 gccatccgac ttggcctgtc tgcccgccat gtgcccagcc tcatcctgga gacccgaggc 
1981 attccataca cactcaatgg caagaaagtg gaggtggccg tgaagcaggt gatggctggg 
2041 aggactgtgg agcaccgggg ggccttctcc aaccccgaga ccctcgacct gtaccgggac 
2101 atccctgagc tgcaggactt ctgagccagc agctcgcact ccatccagct gtgcatgtga 
2161 tggaacttag ggacacttta gagacaacag ctgctccagg tggccctggc actgcacact 
2221 ccacaggctc aggaacagct ttcctctttc ggagtcattg gtgggggcca ggtctcccga 
2281 gctccaggat gtgcctggcc ttcggagacc ctaagcatac actgaggctt gtctcggccc 
2341 tgctgctata ggtcagcaca cagccttgca ggggcagggc tggtgacatt tggtgacagc 
2401 acactggagg agcgatgtgg ccttcagcct catgagcccg ctgttgtatc tcgctctatg 
2461 aaggtgaagt cttacctata ctctctgtcc ttaactggcc tggcttcctc agcgttactg 
2521 tcttctcaag agctgactaa agccagggca cctggcccag catgcctgtc acctcacagc 
2581 tggctgttgg gacacctggg ctgacggctc ctgtaccgtc taccaaggcc agaacaccat 
2641 tctagggaac ggtgagtgaa ttgtccagac agccaagagc tctctgctat ggcaagtccc 
2701 ctctgctgtc gggtgtcccc gactgtttct gggtcttacc agcacagtgt ttgaatgggt 
2761 caaacggttt aatgttaatt gagggcctgg gggcacttga atccctgggt gtctacccca 
2821 ggttaccact tagctgtgtg ctgggcagtg tgatcctgga gtccgccagg gatggagctg 
2881 ccccgcctca gagggaaatg gccccgtgct ccctcctaat agttcctggg ctctttctat 
2941 tgtgactgtc ctctacattt gtaaatgaag tctcaggctt tggctgccag gacttaatgg 
3001 tgcttctcct ctagcacacg gaatatgtca taaatcctgg tgattgcttt tgtatttttt 
3061 tttttaaaaa ctgagcacaa cataaaacct tttaaagata tcctggatct taagtctata 
3121 aaggaaaagt gctatgaaga attttatgga ataaatctgt gccatgcaca cctgg 



SEQ ID NO: 106 

Amino acid sequence of mouse FU12389 encoded by the DNA sequence shown in SEQ ID 
NO: 105. 

MSKLARLEREBIMECQVMWEPDSKKDTQM 

sFWAEFWKFSGIWSRJ^rroEVVDTSKGI 

EGREEIVKVTFEELRQQVALFAAAMRKMGVIOCGDRWGY^ 

8STSPDPOVN0VLDRPSQIQPKLIFSVEAVVYNGKEHGHIjEKLQRVVKGIiPDLQRVVLIP 
YVLPRSKIDISKIPNSVFLDDFLASGTGAQAPQLEFBQLPFSHPLPINFS5GTTGAPKCM 
VHSAGGTL IQHLKEHMLHGNtfTSfiDILLYYTTV^ PLV 
PTFNVLWDLVDRIGITILGTGAKWLSVLEEKDMKPVB^ 

YVYRCIKSSVLWSISGGTDIISCFMGQNSSIPVYKGEIQARNIK^VEAWDEEGKAVWG 
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481 atcgtgaagg tgacttttga agagctgcgg cagcaggtgg ctctgttcgc agctgccatg 
541 aggaagatgg gcgtgaagaa aggggaccgt gtggtcggct atctacccaa cagtgcgcat 
601 gcggtggagg ccatgctggc tgctgccagt attggggcca tttggagttc tacctcgccg 
661 gactttggtg taaatggtgt cctggaccgc ttttctcaaa ttcagccgaa acttatcttc 
5 721 tcggtggagg ctgttgtcta caatggcaag gagcatggtc acctggagaa gctgcagcga 
781 gttgtgaaag gactgcctga cctgcagcga gtggtgctga tcccctatgt cctcccaagg 
841 gagaagattg acatttccaa gatccccaac agtgtgttcc tggatgattt cctggcaagc 
901 gggaccggcg cgcaggcgcc gcagctcgag tttgagcagc tgcccttcag ccaccctctg 
961 ttcatcatgt tctcctcagg cacgacggga gcgcccaagt gcatggtgca ctcagccggg 
10 1021 ggcaccctca tccagcacct gaaggagcac atgctacacg gcaacatgac aagcagtgac 
1081 atcctgctct actacaccac ggtcggctgg atgatgtgga actggatggt gtcagccctg 
1141 gccacaggag catccttggt cctgtatgat ggctccccgc tggttccgac ccccaacgtg 
1201 ttgtgggacc ttgtggacag gatagggatc accatcctgg gaacgggagc caagtggctg 
1261 tcagtgctgg aggagaagga catgaagcca gtggaaactc acaacctcca cacgctgcac 
15 1321 acgatcctgt ccaccggctc gccgctgaaa gcccagagtt acgagtatgt gtacagatgc 
1381 atcaagagct ccgtgctcct gggctccatc tcaggaggca ctgacatcat ctcctgtttc 
1441 atgggccaga actcctctat tcctgtgtac aagggtgaga tccaagcccg gaaccttggc 
1501 atggctgtgg aagcctggga cgaggaaggg aaggccgtct ggggagcgag tggcgagctg 
1561 gtgtgcacta agcccattcc ctgccagccc acgcacttct ggaacgacga gaacggcagc 
20 1621 aagtaccgga aggcttactt ctccaaattc ccaggtgtct gggcacacgg tgactactgc 
1681 aggatcaacc ccaaaacagg aggcattatc atgctgggcc gtagtgatgg caccctcaac 
1741 cccaatggcg tccgctttgg cagctcggag atctacaaca tcgtggaagc cttcgatgag 
1801 gtggaggaca gcctgtgtgt accccagtac aacagagatg gcgaggagcg ggtggtcctg 
1861 ttcctgaaga tggcgtccgg gcacactttc cagcctgacc tcgtgaagcg catccgagac 
25 1921 gccatccgac ttggcctgtc tgcccgccat gtgcccagcc tcatcctgga gacccgaggc 
1981 attccataca cactcaatgg caagaaagtg gaggtggccg tgaagcaggt gatggctggg 
2041 aggactgtgg agcaccgggg ggccttctcc aaccccgaga ccctcgacct gtaccgggac 
2101 atccctgagc tgcaggactt ctgagccagc agctcgcact ccatccagct gtgcatgtga 
.2161 tggaacttag ggacacttta gagacaacag ctgctccagg tggccctggc actgcacact 
30 2221 ccacaggctc aggaacagct ttcctctttc ggagtcattg gtgggggcca ggtctcccga 
2281 gctccaggat gtgcctggcc ttcggagacc ctaagcatac actgaggctt gtctcggccc 
.2341 tgctgctata ggtcagcaca cagccttgca ggggcagggc tggtgacatt tggtgacagc 
2401 acactggagg agcgatgtgg ccttcagcct catgagcccg ctgttgtatc tcgctctatg 
2461 aaggtgaagt cttacctata ctctctgtcc ttaactggcc tggcttcctc agcgttactg 
35 2521 tcttctcaag agctgactaa agccagggca cctggcccag catgcctgtc acctcacagc 
2581 tggctgttgg gacacctggg ctgacggctc ctgtaccgtc taccaaggcc agaacaccat 
2641 tctagggaat ggtgagtgaa ttgtccagac agccaagagc tctctgctat ggcaagtccc 
2701 ctctgctgtc gggtgtcccc gactgtttct gggtcttacc agcacagtgt ttgaatgggt 
2761 caaacggttt aatgttaatt gagggcctgg gggcacttga atccctgggt gtctacccca 
40 2821 ggttaccact tagctgtgtg ctgggcagtg tgatcctgga gtccgccagg gatggagctg 
2881 ccccgcctca gagggaaatg gccccgtgct ccctcctaat agttcctggg ctctttctat 
2941 tgtgactgtc ctctacattt gtaaatgaag tctcaggctt tggctgccag gacttaatgg 
3001 tgcttctcct ctagcacacg gaatatgtca taaatcctgg tgattgcttt tgtatttttt 
3061 tttttaaaaa ctgagcacaa cataaaacct tttaaagata tcctggatct taagtctata 
45 3121 aaggaaaagt gctaSgaaga attttatgga ataaatctgt gccatgcaca cctgg 



SEQIDN6:106 

Amino acid sequence of mouse FIJI 2389 encoded by the DNA sequence shown in SEQ ID 
NO: 105. 

50 MSKLARLEREEIMECQWWEPDSKKDTQMDRFRAAVGra 
JPWAEFWKFSGIVYSRMYDEVVDTSKGIADVPE 

EGREE I VKVTFEELRQQVAL FAAAMRKMGVKKGDR WG YLPNS AHAVEAMLAAAS IGAI W 
SSTSPDFGVNGVtDRFSQIQPKLIFSVEAWYNGKEHGHLEKLQRWKGLPDLQRWIiIP 
YVLPRBKIDISKIPNSVFLDDFLASGTGAQAPQLEFEQLPFSHPLFIMFSSGTTGAPKCM 
55 VHSAGGTLIQHLKEHMLHGNMTSSDILLYYTW 

PTPNVLWDL VDR IG I TI LGTG AKWLS VLEEKDMKP VETHNLHTLHTI LSTGS PL KAQS YE 
YVYRCIKSSVLLGSISGGTDIISCFMGQNSSIPVYKGEIQARNLGMAVEAWDEEGKAVWG 
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ASGELVCTKPIPCQPTHFWOTENGSKYRKAyPSKFPOWAHGDYCRINPKTGGIIMLGRS 
DGTLNPNGVRFGSSEIYNIVEAFDEVEDSIXrVP 

KR I RD A I RLGLS ARHVP ShlL ETRG I P YTLNGKKVE VAVKQ VMAG RTVKHRGAF S NPETL 
DLYRDIPELQDF 

SEQIDNO: 107 

gi|12831226|ref|NM_023104.1| Rattus horvegicus acetoacetyl-CoA synthetase (LOC65984), 
mRNA 



1 tctcgcgctg tggttcgtcc gcgccacgct gagccgctcc acgccttgcg ctctccgctg 
61 tctccgcagc taaagcccgg gcagccccgg ccacgcagct ccgcaaccat gtccaagctg 
10 121 gcacggctcg agcgcgagga gatcatggag tgccaggtga tgtgggagcc tgacagcaag 
181 aaggacacgc agatggaccg cttccgggcg gccgtgggta ctgcctgcgg cctggcgctt 
241 gggaattacg atgacttata ccactggtct gtccggtcgt attcagactt ctgggctgag 
301 ttctggaagt tcagtggaat tgtctgctct cgcatgtatg atgaggttgt ggacacatcc 
361 aaaggaattg cagatgtccc tgagtggttc agaggcagcc gcctcaacta tgcagagaac 
15 421 cttctgcggc acaaggagaa cgacagagtc gccctttacg tggcccggga aggcagagag 
481 gagattgcga aggtgacttt cgaagagctt cggcagcagg tggctctgtt tgcagccgcc 
541 atgaggaaga tgggcgtgaa gaaaggggac cgtgtggtcg gttatctccc caacagtgcc 
601 catgccgtgg aggccatgct ggctgctgcc agtattggag ccatttggag ttctacctca 
661 ccagactttg gtgtgaatgg tgtcctggac cgcttttctc aaattcagcc gaaacttatc 
20 721 ttctcggtgg aagctgtggt ctacaacggc aaggaacacg gccacctgga gaagctgcag 
781 cgagtcgtga aaggacttcc tgaccttcag cgagtggtgc tgatccccta tgtcctccca 
841 agggagaaga tagacatttc caagatcccc aacagcatgt ttctggatga cttcctggca 
901 agcgggacag gtgcgcaggc accacagctc gagtttgaac agctgccctt cagccatccc 
961 ctgttcatca tgttctcctc gggcacgaca ggagcgccca agtgcatggt gcactctgct 
25 1021 gggggcaccc tcatccagca cctgaaggag cacgtgctac atggcaacat gacaagcagt 
1081 gacatcctgc tctactacac cacggtcggc tggatgatgt ggaactggat ggtgtcagcg 
1141 ctggccacag gagcatcctt ggttctgtac gatggctccc cgctggttcc aacacccaat 
1201 gtgttgtggg accttgtgga caggatagga atcaccatcc tgggaacggg agccaagtgg 
1261 ctgtcagtgc tggaggagaa ggacatgaag ccgatggaaa ctcacaacct ccacacgctc 
30 1321 cacacgatcc tgtccaccgg ctcgccactg aaagcccaga gctatgagta tgtgtacaga 
1381 tgcatcaaga gcaccgtgct cctcggctcc atctcaggtg gcactgacat catctcctgt 
1441 ttcatgggcc agaactcatc tattcctgtg tacaagggtg agatccaagc ccggaacctc 
1501 ggcatggccg tggaagcctg ggacgaggaa gggaaaaccg tctggggagc gagtggcgag 
1561 ctggtttgca ccaagcccat accctgccag cccacgcact tctggaacga cgagaacggc 
35 1621 agcaagtaca ggaaggctta cttctccaaa tacccaggtg tctgggcaca cggcgactac 
1681 tgcaggatca accccaagac aggaggtatc gtcatgttgg gccggagtga tggcaccctc 
1741 aaccccaatg gcgtacgctt tggcagctcg gagatctaca acattgtgga agccttcgat 
1801 gaggtggagg acagcctttg tgtgccccag tacaacaggg atggtgagga gcgggtagtc 
1861 ctgtttctga agatggcctc tgggcacact ttccagcccg acctcgtgaa gcacatccgt 
40 1921 gatgccatcc gccttggcct gtctgctcgc cacgtgccca gcctcatcct ggagacccaa 
1981 ggcattccat acacaatcaa cggcaagaaa gtggaggtgg ccgtgaagca ggtgatagct 
2041 gggaagactg tggagcaccg gggggccttc tccaaccctg agtccctgga cctgtatcgg 
2101 gacatccctg agctgcagga cttctgaacc aatggctcgc actcggtcca gctgtgcacg 
2161 tgatggaact tagggacagc agctgctccg gtggcccagg cactacgcac tccacgggct 
45 2221 cagagacagc ttccctgttt cggagtcatg ggtggggacc aggtctgtct cctgagctcc 
2281 aggatgtgcc tggccttcgg agaccaaaag ggtacactga ggcctgtctt ggccctgctg 
2341 ctataggtca gcacacagcc ttgcaggcgc agggctggtg acatttggtg acaacacact 
2401 ggaggagcga tgtggccttc agcctcatga gcccacagtt ccacctcgct ctatgaaggt 
2461 gaaatcttac ctgtattctc tgcccttaat aactggtctg gcttcctcag gtgtactgtc 
50 2521 ttctcgagag ctgcctgaag ccagggcacc tggtccagca ggcctgtcac ctcacacttg 
2581 gQtgttggga tgcctgggct gatggctctg tctgccaagg ccagaacacc attctaggga 
2641 actgcgagtg aatcctccag acagccaaga gctctctgct gtggcaagtc ccctctgctg 
2701 gcgagcgtcc ccgactgttt ctgggtctta acagcacaag tatttgaatg ggtcaaacgt 
2761 caatgttact ggaggatctg ccgtctggag gcacttgaat cccctgggtg tctaccctag 
55 2821 gttaccactt agctgtgtgc tgggcggtgt gatcctggag tcttcccggg atggagctgc 
. 2881 ccggcctcag aggaaaatgg ccgcgtgctc cctcctaata gttccgggct ctttctcttg 
2941 tgactgtact ccatgtttgt aaatgacgtc tcaggctttg gctgccagga ctcaatggtg 
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3001 cttcttgtct agcacacaga atatgtcata 
3061 tctttctttt ttttttttta aaactgagca 
3121 ttttaactct ataagggaaa agtgctatga 
3181 acgcctatcc 

5 SEQIDNO:108 



aatcctggtg attatacttt tgtatttctt 
caacataaaa ccttttaaaa atatcttgga 
agaattttat ggaataaacc tgtgccatgc 



Amino acid sequence of rat FLJ12389 encoded by the DNA sequence shown in SEQ ID NO: 
107. 



MSKIJUILEREEIMECQVMWEPDSKKDTQ^RFIUUVVGTACGIjALGNYDDLYHWSVRSYSD 
FWAEFWKPSGIVCSRMYDEVVDTSKGIADWEWFRGSRLNYAK^LRHKENDRVALYV^ 

1 0 EGREEIAKVTFEELRQQVALFAAAMRKMGVKKGDRWGYLPNSAHAVEAMLAAAS IGAI W 
SSTSPDFGWGVLDRFSOIQPKLIFSVEAVVYNGKEHGHLEKLQRVVKGLPDLQRVVLIP 
YVLPREKIDISKIPNSMFLDDFLASGTGAQAPQLEFEQLPFSHPLFIMFSSGTTGAPKCM 
VHSAGGTL I QHLKEHVLHGNMTS S D I YYTTVG WMMWNWMVS AliATG AS LVLYDGSPLV 
PTPNVLWDLVDRIGITILGTGAKWLSVLEEKDMKPMETHNIJfTLHTILSTGSPLKAQSYE 

1 5 YVYRC I KSTVLLGS I SGGTD 1 1 S CFMGQNS S I PVYKGE IQARNLGMAVEAWDEEGKTVWG 
ASGELVCTKPIPCQPTHFWNDENGSKYRKAYFSKYPGVWAHGDYCRINPKTGGIVMLGRS 
DGTLNPNOWFGSSEIYNIVEAFDEVEDSLCVPQYNRM^ 
KHIRDAIRLGLSARHVPSLILETCXSIPYTINGKKVEVAVKQVIAGKTVEra 
DLYRD IPELQDF 



20 SEQ ID NO: 109 

gi|22547160|reflNMJ)12193.2| Homo sapiens frizzled homolog 4 (Drosophila) (FZD4), 
mRNA 

1 gctgcgcagc gctggctgct ggctggcctc gcggagacgc cgaacggacg cggccggcgc 
61 cggcttgtgg gctcgccgcc tgcagccatg accctcgcag cctgtccctc ggcctcggcc 
25 121 cgggacgtct aaaatcccac acagtcgcgc gcagctgctg gagagccggc cgctgccccc 
181 tcgtcgccgc atcacactcc cgtcccggga gctgggagca gcgcgggcag ccggcgcccc 
241 cgtgcaaact gggggtgtct gccagagcag ccccagccgc tgccgctgct acccccgatg 
301 ctggccatgg cctggcgggg cgcagggccg agcgtcccgg gggcgcccgg gggcgtcggt 
361 ctcagtctgg ggttgctcct gcagttgctg ctgctcctgg ggccggcgcg gggcttcggg 
30 421 gacgaggaag agcggcgctg cgaccccatc cgcatctcca tgtgccagaa cctcggctac 
481 aacgtgacca agatgcccaa cctggttggg cacgagctgc agacggacgc cgagctgcag 
541 ctgacaactt tcacaccgct catccagtac ggctgctcca gccagctgca gttcttcctt 
601 tgttctgttt atgtgccaat gtgcacagag aagatcaaca tccccattgg cccatgcggc 
661 ggcatgtgtc tttcagtcaa gagacgctgt gaacccgtcc tgaaggaatt tggatttgcc 
35 721 tggccagaga gtctgaactg cagcaaattc ccaccacaga acgaccacaa ccacatgtgc 
781 atggaagggc caggtgatga agaggtgccc ttacctcaca aaacccccat ccagcctggg 
841 gaagagtgtc actctgtggg aaccaattct gatcagtaca tctgggtgaa aaggagcctg 
901 aactgtgtgc tcaagtgtgg ctatgatgct ggcttataca gccgctcagc caaggagttc 
961 actgatatct ggatggctgt gtgggccagc ctgtgtttca tctccactgc cttcacagta 
40 1021 ctgaccttcc tgatcgattc ttctaggttt tcctaccctg agcgccccat catatttctc 
1081 agtatgtgct ataatattta tagcattgct tatattgtca ggctgactgt aggccgggaa 
1141 aggatatcct gtgattttga agaggcagca gaacctgttc tcatccaaga aggacttaag 
12 01 aacacaggat gtgcaataat tttcttgctg atgtactttt ttggaatggc cagctccatt 
1261 tggtgggtta ttctgacact cacttggttt ttggcagcag gactcaaatg gggtcatgaa 
45 1321 gccattgaaa tgcacagctc ttatttccac attgcagcct gggccatccc cgcagtgaaa 
1381 accattgtca tcttgattat gagactggtg gatgcagatg aactgactgg cttgtgctat 
1441 gttggaaacc aaaatctcga tgccctcacc gggttcgtgg tggctcccct ctttacttat 
1501 ttggtcattg gaactttgtt cattgctgca ggtttggtgg ccttgttcaa aattcggtca 
1561 aatcttcaaa aggatgggac aaagacagac aagttagaaa gactgatggt caagattggg 
50 1621 gtgttctcag tactgtacac agttcctgca acgtgtgtga ttgcctgtta tttttatgaa 
1681 atctccaact gggcactttt tcggtattct gcagatgatt ccaacatggc tgttgaaatg 
1741 ttgaaaattt ttatgtcttt gttggtgggc atcacttcag gcatgtggat ttggtctgcc 
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1801 aaaactcttc acacgtggca gaagtgttcc aacagattgg tgaattctgg aaaggtaaag 
1861 agagagaaga gaggaaatgg ttgggtgaag cctggaaaag gcagtgagac tgtggtataa 
1921 ggctagtcag cctccatgct ttcttcattt tgaagggggg aatgccagca ttttggagga 
1981 aattctacta aaagttttat gcagtgaatc tcagtttgaa caaactagca acaattaagt 
5 2041 gacccccgtc aacccactgc ctcccacccc gaccccagca tcaaaaaacc aatgattttg 
2101 ctgcagactt tggaatgatc caaaatggaa aagccagtta gaggctttca aagctgtgaa 
2161 aaatcaaaac gttgatcact ttagcaggtt gcagcttgga gcgtggaggt cctgcctaga 
2221 ttccaggaag tccagggcga tactgttttc ccctgcaggg tgggatttga gctgtgagtt 
2281 ggtaactagc agggagaaat attaactttt ttaacccttt accattttaa atactaactg 
10 2341 ggtctttcag atagcaaagc aatctataaa cactggaaac gctgggttca gaaaagtgtt 
2401 acaagagttt tatagtttgg ctgatgtaac ataaacatct tctgtggtgc gctgtctgct 
2461 gtttagaact ttgtggactg cactcccaag aagtggtgtt agaatctttc agtgcctttg 
2521 tcataaaaca gttatttgaa caaacaaaag tactgtactc acacacataa ggtatccagt 
2581 ggatttttct tctctgtctt cctctcttaa atttcaacat ctctcttctt ggctgctgct 
15 2641 gttttcttca ttttatgtta atgactcaaa aaaggtattt ttatagaatt tttgtactgc 
2701 agcatgctta aagaggggaa aaggaagggt gattcacttt ctgacaatca cttaattcag 
2761 aggaaaatga gatttactaa gttgacttac ctgacggacc ccagagacct attgcattga 
2821 gcagtgggga cttaatatat tttacttgtg tgattgcatc tatgcagacg ccagtctgga 
2881 agagctgaaa tgttaagttt cttggcaact ttgcattcac acagattagc tgtgtaattt 
20 2941 ttgtgtgtca attacaatta aaagcacatt gttggaccat gacatagtat actcaactga 
3 001 ctttaaaact atggtcaact tcaacttgca ttctcagaat gatagtgcct ttaaaaattt 
3061 ttttattttt taaagcataa gaatgttatc agaatctggt ctacttagga caatggagac 
3121 tttttcagtt ttataaaggg aactgaggac agctaatcca actacttggt gcgtaattgt 
3181 ttcctagtaa ttggcaaagg ctccttgtaa gatttcactg gaggcagtgt ggcctggagt 
25 3241 atttatatgg tgcttaatga atctccagaa tgccagccag aagcctgatt ggttagtagg 
3301 gaataaagtg tagaccatat gaaatgaact gcaaactcta atagcccagg tcttaattgc 
3361 ctttagcaga ggtatccaaa gcttttaaaa tttatgcata cgttcttcac aagggggtac 
3421 ccccagcagc ctctcgaaaa ttgcacttct cttaaaactg taactggcct ttctcttacc 
3481 ttgccttagg ccttctaatc atgagatctt ggggacaaat tgactatgtc acaggttgct 
30 3541 ctccttgtaa ctcatacctg tctgcttcag caactgcttt gcaatgacat ttatttatta 
3601 attcatgcct taaaaaaata ggaagggaag cttttttttt tctttttttt tttttcaatc 
3661 acactttgtg gaaaaacatt tccagggact caaaattcca aaaaggtggt caaattctgg 
3721 aagtaagcat ttcctctttt ttaaaaattt ggtttgagcc ttatgcccat agtttgacat 
3781 ttccctttct tctttccttt ttgtttttgt gtggttcttg agctctctga catcaagatg 
35 3841 catgtaaagt cgattgtatg ttttggaagg caaagtcttg gcttttgaga ctgaagttaa 
3901 gtgggcacag gtggcccctg ctgctgtgcc cagtctgagt accttggcta gactctaggt 
3961 caggctccag gagcatgaga attgatcccc agaagaacca ttttaactcc atctgatact 
4021 ccattgccta tgaaatgtaa aatgtgaact ccctgtgctg cttgtagaca gttcccataa 
4081 ctgtccacgg ccctggagca cgcacccagg ggcagagcct gcccttactc acgctctgct 
40 4141 ctggtgtctt gggagttgtg cagggactct ggcccaggca ggggaaggaa gaccaggcgg 
4201 taggggactg gtcttgctgt tagagtatag aggtttgtaa tgcagttttc ttcataatgt 
4261 gtcagtgatt gtgtgaccaa ggcagcatct agcagaaagc caggcatgga gtaggtgatc 
4321 gatacttgtc aatgactaaa taataacaat aaaagagcac ttgggtgaat ctgggcacct 
4381 gatttctgag ttttgagttc tggagctagt gttttgacaa tgctttgggt tttgacatgc 
45 4441 cttttccaca aatctcttgc cttttcaggg caaagtgtat ttgatcagaa gtggccattt 
4501 ggattagtag ccttagcaat gctacagggt tataggcctc tcctttcaca ttccagacaa 
4561 tggagagtgt ttatggtttc aggaaaagaa ctttgtggct gaggggtcag ttaccagtga 
4621 ccttcaatca actccatcac ttcttaaatc ggtatttgtt aaaaaaatca gttattttat 
4681 ttatt 9 a 9 t 9 ccgactgtag taaagccctg aaatagataa tctctgttct tctaactgat 
50 4741 ctaggatggg gacgcaccca ggtctgctga actttactgt tcctctggga aaggagcagg 
4801 gacctctgga attcccatct gtttcactgt ctccattcca taaatctctt cctgtgtgag 
4861 ccaccacacc cagcctgggt ctctctactt ttaacacatc tctcatccct ttcccaggat 
4921 tccttccaag tcagttacag gtggttttaa cagaaagcat cagctctgct tcgtgacagt 
4981 ctctggagaa atcccttagg aagactatga gagtaggcca caaggacatg ggcccacaca 
55 5041 tctgctttgg ctttgccggc aattcagggc ttggggtatt ccatgtgact tgtataggta 
5101 tatttgagga cagcatctt'g ctagagaaaa ggtgagggtt gtttttcttt ctctgaaacc 
5161 tacagtaaat gggtatgatt gtagcttcct cagaaatccc ttggcctcca gagattaaac 
5221 atggtgcaat ggcacctctg tccaacctcc tttctggtag attcctttct cctgcttcat 
5281 ataggccaaa cctcagggca agggaacatg ggggtagagt ggtgctggcc agaaccatct 
60 5341 gcttgagcta cttggttgat tcatatcctc tttcctttat ggagacccat ttcctgatct 
5401 ctgagactgt tgctgaactg gcaacttact tgggcctgaa actggagaag gggtgacatt 
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5461 tttttaattt cagagatgct ttctgatttt cctctcccag gtcactgtct cacctgcact 

5521 ctccaaactc aggttccggg aagcttgtgt gtctagatac tgaattgaga ttctgttcag 

5581 caccttttag ctctatactc tctggctccc ctcatcctca tggtcactga attaaatgct 

5641 tattgtattg agaaccaaga tgggacctga ggacacaaag atgagctcaa cagtctcagc 

5 5701 cctagaggaa tagactcagg gatttcacca ggtcggtgca gtatttgatt tctggtgagg 

5761 tgaccacagc tgcagttagg gaagggagcc attgagcaca gactttggaa ggaacctttt 

5821 ttttgttgtt tgtttgtttg tttgtttgtt tgtttgtttg agacagggtc ttgctctgtc 

5881 acccaggctg gggcgcaatg gcacgatctt ggctcactgc aacctctgcc tcctgggttc 

5941 aagtgattct cctgccacag cctcctgagg agctgggact acaggtgcgt gctaccacgc 

10 6001 ccagctactt ctgtattttt agtagagacg gggtttcact gtgttggcca ggctggtctc 

6061 gaactcctga cctcatgatc tgcccgcctc agcctcccaa agtgctggga ttacaagtgt 

6121 gagccaccac acctggcctg gaaggaacct cttaaaatca gtttacgtct tgtattttgt 

6181 tctgtgatgg aggacactgg agagagttgc tattccagtc aatcatgtcg agtcactgga 

6241 ctctgaaaat cctattggtt cctttatttt atttgagttt agagttccct tctgggtttg 

15 6301 tattatgtct ggcaaatgac ctgggttatc acttttcctc cagggttaga tcatagatct 

6361 tggaaactcc ttagagagca ttttgctcct accaaggatc agatactgga gccccacata 

6421 atagatttca tttcactcta gcctacatag agctttctgt tgctgtctct tgccatgcac 

6481 ttgtgcggtg attacacact tgacagtacc aggagacaaa tgacttacag atcccccgac 

6541 atgcctcttc cccttggcaa gctcagttgc cctgatagta gcatgtttct gtttctgatg 

20 6601 tacctttttt ctcttcttct ttgcatcagc caattcccag aatttcccca ggcaatttgt 

6661 agaggacctt tttggggtcc tatatgagcc atgtcctcaa agcttttaaa cctccttgct 

6721 ctcctacaat attcagtaca tgaccactgt catcctagaa ggcttctgaa aagaggggca 

6781 agagccactc tgcgccacaa aggttgggtc catcttctct ccgaggttgt gaaagttttc 

6841 aaattgtact aataggctgg ggccctgact tggctgtggg ctttgggagg ggtaagctgc 

25 6901 tttctagatc tctcccagtg aggcatggag gtgtttctga attttgtcta cctcacaggg 

6961 atgttgtgag gcttgaaaag gtcaaaaaat gatggcccct tgagctcttt gtaagaaagg 

7021 tagatgaaat atcggatgta atctgaaaaa aagataaaat gtgacttccc ctgctctgtg 

7081 cagcagtcgg gctggatgct ctgtggcctt tcttgggtcc tcatgccacc ccacagctcc 
7141. aggaaccttg aagccaatct gggggacttt cagatgtttg acaaagaggt accaggcaaa 

30 7201 cttcctgcta cacatgccct gaatgaattg ctaaatttca aaggaaatgg accctgcttt 

7261 taaggatgta caaaagtatg tctgcatcga tgtctgtact gtaaatttct aatttatcac 

7321 tgtacaaaga aaaccccttg ctatttaatt ttgtattaaa ggaaaataaa gttttgtttg 

7381 ttaaaaaaaa a 

SEQ ID NO: 110 

35 Amino acid sequence of human FZD4 encoded by the DNA sequence shown in SEQ ID NO: 
109. 

MAWRGAOPSVPGAPGGVQLSLGLLLQLLLLLQPAHGFGDEEBRRCDPIRISMCQNLOYNV 
TKMPNLVGHELQTDAELQLTTFTPLIQYGCSSQLQFFLCSVYVPMCTEKINIPIGPCGGM 
CLSVKimCEPVLKEFGFAWPESLNCSKFPPQNDHNHMCMEGPGDEEVPLPHKTPIQPGEE 

40 CHSVGTNSDQYIWVKRSLNCVLKCGYDAGLYSRSAKEFTDIWMAVWASLCFISTAFTVLT 
FLIDSSRFSYPERPIIFLSMCYNIYSIAYIVRLTVGRERISCDFEEAAEPVLIQEGLKNT 
GCAX I FLLMYFFGMASSIWWVILTLTWFIiAAGLKWGHEAIEMHSSYFHIAAWAIPAVKTI 
VILIMRIiVDADELTGLCYVGNQNLDALTGFWAPLFTYLVIGTLFIAAGLVALFKIRSNL 
QKDGTKTDKLERLMVKIGVFSVLYTVPATCVIACYFYEISNWAIiFRYSADDSNMAVEMLK 

45 IFMSLLVGITSGMWIWSAKTLHTWQKCSNRLVNSGKVKREKRGNGWVKPGKGSETVV 

SEQ ED NO: 111 

gi|13548680|dbj|AB05488U| Homo sapiens mRNA for Soluble-type polypeptide FZD4S, 
complete cds 

1 atcccacaca gtcgcgcgca gctgctggag agccggccgc tgccccctcg tcgccgcatc 

50 61 acactcccgt cccgggagct gggagcagcg cgggcagccg gcgcccccgt gcaaactggg 

121 ggtgtctgcc agagcagccc cagccgctgc cgctgctacc cccgatgctg gccatggcct 

181 ggcggggcgc agggccgagc gtcccggggg cgcccggggg cgtcggtctc agtctggggt 
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241 tgctcctgca gttgctgctg ctcctggggc cggcgcgggg cttcggggac gaggaagagc 
.301 ggcgctgcga ccccatccgc atctccatgt gccagaacct cggctacaac gtgaccaaga 
361 tgcccaacct ggttgggcac gagctgcaga cggacgccga gctgcagctg acaactttca 
421 caccgctcat ccagtacggc tgctccagcc agctgcaggt gggcgccccc acccccaccc 
5 4B1 ctggcgggac cccttgggga gggacgctcc aaactaactt tgcagagccc atgccaagtt 
541 gatcatccta gcccgaagga gacagctctc ctggaaaagt gatttccatc cccaccccca 
601 ccttctcagg gactggaagc caaaacgttg tgtaagtcat ctggcctggg aaagaaccca 
661 ctcttacacc ccgcccttcc gtttttctct cttgcccacc catgtctggc ccagacccct 
721 aaccccagtg gagctgaggg ttatctctgc caaggattct ggccgccact gctcggtggg 
10 781 cgagtgggag ggagccagct gcagctaaga cttggaggga gtgataagga gaccatcctt 
841 caccgaactt ccttctcccc actatttttg gggtgttatt aaatgaaaca ctactgcacc 
901 tttttctgct gaatagcccc ttctcacgac gtcccgcagc gttttagagg tcatgcatga 
961 aggagtggtt gggttggctt gagttctttc ttatcctcac agttgaaggc acgtttaatg 
1021 cttggagggt gagaagaagc tgcaggaagg tggttggtat attggaagaa atttttttgc 
15 1081 agtcattaaa aaatgtaaag catatctaat gatagagaaa atgtttattc cacagtaata 
1141 agaatttgca tatacagggt gattataatc ctgcaaaata aaaaatttat tgggataata 
1201 aaagactgac aggaaaattt ttaaagtgtt aacattggtt atgtttgggt tggtgaggct 
1261 gggtgatttt tagaaattta gaaaagagag aagttgtggg aaaaagtaga cgttagttgt 
1321 ccaggcttca gaaatctggt ttctggccgg gcgcagtggc tcacgcctgt aatcccagca 
20 1381 ctttgggagg ccgaggcagg tggctcacga ggttaggagt tcgagaccag cctgaccaac 
1441 atggcgaaat cctgtctcta ctaaaagtac aaaaattagc cgggcgtggt ggcgtgcgcc 
1501 tgtaatccca gctactcagg aggctgaggc aggagaatcg cttgaaccca ggaggcagag 
1561 gttgcagtga gcggggattg caccattgca ctctagcctg ggccacagga gcgaaactcc 
1621 gtatcaaaaa aaaaaagaaa agaaaagaaa tctgtggttc tggctgtaac ttctgagaga 
25 1681 taataaacag acatacagct tacttactag gtgcatgttt ttacataggt cattttaaaa 
1741 cttggtttgt ggttttggaa agatggtggg gattagacct gttgctgcta aaatatccct 
1801 ttggaagttt cagaacttta gaactaggtt ataaaagctt aagaaatgat cctccagctc 
1861 tcagtttgga aacagcacat atcctgacat cagtggcaat ttgttggaga aacagctttt 
1921 gcagggatat atatttttaa ttacatgtat cctggggaaa gcggccaagc tgttttgaag 
30 1981 gacaagactg gatcccttta catgctggaa aatagttact tgtaactcta gacttcatag 
2041 acaacatctg taaggagcca agcagactgt ttatcaatgc tggaggaatt gaggacagca 
2101 actggactct cccctgctgg tactgcagtt tcagcgactt aacatttaca ttttcctatt 
2161 gtgcttcatt attattcatt gaggaaacct ggccaaacaa atccctagtg accagagacc 
2221 tcaatctgca tcccatcctt gagtctaaca gtgtctcatt tgggccattt tttgtttcag 
35 2281 tatgggacta gtttaaacaa atttttaagg gataaattag ggagaagaga atactaacag 
2341 gatctacatt tttttttaac ttttattttt aaaatgataa agaaacgggg ataaaaataa 
2401 accacctaaa tgagtcatat tagtttcttt agctttttca aatagtttct aaaatggaag 
2461 gagattgaaa gccagttttt taagatttat ttcaagtgtt gtgcaaagag ggcttatgca 
2521 attcttcaga ccctatttat tttatttatt tatttatgta cattcttctg gaaaggatta 
40 2581 ctaaagccaa actaagaaaa gtatgtaagt gatggattat caggcgccct tgtttgaggt 
2641 cagagtaaac agacagactc ccgataggtt ggtttaaaag tagataccta gattgcctgg 
2701 aagcattcaa ctcagctttg tgggagcatt tggtcaaact tccaagtcat tgtttttttt 
2761 gtgtgtgttc attttgttac ttttccagtt cttcctttgt tctgtttatg tgccaatgtg 
2821 cacagagaag atcaacatcc ccattggccc atgcggcggc atgtgtcttt cagtcaagag 
45 2881 acgctgtgaa cccgtcctga aggaatttgg atttgcctgg ccagagagtc tgaactgcag 
2941 caaattccca ccacagaacg accacaacca catgtgcatg gaagggccag gtgatgaaga 
3001 ggtgccctta cctcacaaaa cccccatcca gcctggggaa gagtgtcact ctgtgggaac 
3061 caattctgat cagtacatct gggtgaaaag gagcctgaac tgtgtgctca agtgtggcta 
3121 tgatgctggc ttatacagcc gctcagccaa ggagttcact gatatctgga tggctgtgtg 
50 3181 ggccagcctg tgtttcatct ccactgcctt cacagtactg accttcctga tcgattcttc 
3241 taggttttcc taccctgagc gccccatcat atttctcagt atgtgctata atatttatag 
3301 cattgcttat attgtcaggc tgactgtagg ccgggaaagg atatcctgtg attttgaaga 
3361 ggcagcagaa cctgttctca tccaagaagg acttaagaac acaggatgtg caataatttt 
3421 cttgctgatg tacttttttg gaatggccag ctccatttgg tgggttattc tgacactcac 
55 3481 ttggtttttg gcagcaggac tcaaatgggg tcatgaagcc attgaaatgc acagctctta 
3541 tttccacatt gcagcctggg ccatccccgc o.gtgaaaacc attgtcatct tgattatgag 
3601 actggtggat gcagatgaac tgactggctt gtgctatgtt ggaaaccaaa atctcgatgc 
3661 cctcaccggg ttcgtggtgg ctcccctctt tacttatttg gtcattggaa ctttgttcat 
3721 tgctgcaggt ttggtggcct tgttcaaaat tcggtcaaat cttcaaaagg atgggacaaa 
oO 3781 gacagacaag ttagaaagac tgatggtcaa gattggggtg ttctcagtac tgtacacagt 
3841 tcctgcaacg tgtgtgattg cctgttattt ttatgaaatc tccaactggg cactttttcg 
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3901 gtattctgca gatgattcca acatggctgt tgaaatgttig aaaattttta tgtctttgtt 
3961 ggtgggcatc acttcaggca tgtggatttg gtctgccaaa actcttcaca cgtggcagaa 
4021 gtgttccaac agattggtga attctggaaa ggtaaagaga gagaagagag gaaatggttg 
4081 ggtgaagcct ggaaaaggca gtgagactgt ggtataaggc tagtcagcct ccatgctttc 
5 4141 ttcattttga aggggggaat gccagcattt tggaggaaat tctactaaaa gttttatgca 
4201 gtgaatctca gtttgaacaa actagcaaca attaagtgac ccccgtcaac ccactgcctc 
4261 ccaccccgac cccagcatca aaaaaccaat gattttgctg cagactttgg aatgatccaa 
4 321 aatggaaaag ccagttagag gctttcaaag ctgtgaaaaa tcaaaacgtt gatcacttta 
4381 gcaggttgca gcttggagcg tggaggtcct gcctagattc caggaagtcc agggcgatac 
10 4441 tgttttcccc tgcagggtgg gatttgagct gtga 

SEQIDN0:112 

Amino acid sequence of human FZD4 variant ORF number 1 encoded by the DNA sequence 
shown in SEQ ID NO: 111. 

MLA>IAWRGAGPSVPGAPGGVGLSLGLLLQLLLLLGPARQFQDEEERRCDPIRISMCQNLG 
15 YNVTKMPNLVGHELQTDAELQLTTPTPLIQyGCSSQLQVGAPTPTPGGTPWGGTLQTNFA 
EPMPS 

SEQ ID NO: 113 

gi 1 1 5 929644|gb |BC0 1 52 5 6. 1 1 Mus musculus frizzled homolog 4 (Drosophila), mRNA (cDNA 
clone MGC: 1 8403 IM AGE:4238940), complete cds 

20 l .agcgctgggg cggtgagaac agcgcggcgt agagtgcagg cgggcttcgc cgaaaagccg 

61 gactcggccg gcgccgagtt ctgggatcgc cgcctgcagc catgacccta gcagtccatc 
121 cctcggcccg ggctccggac gtctgatatc ccgcacattc tcgtacaact gctggagagg 
181 cgactgctgc ccccttgtcg cccttggcgc cttaccgcat tccctatccg gagttgggag 
241 cagcgcggcc accggcgccc ctgtgcaaac tgggggtgtc tgctagatca gcctctgccg 
25 301 ctgctgcccg cagctctggc catggcctgg ccgggcacag ggccgagcag ccggggggcg 
361 cctggaggcg tcgggctcag gctggggctg ctgctgcagt tgctcctgct cctgcggccg 
421 acattggggt tcggggacga ggaggagcgg cgctgcgacc ccatccgcat cgccatgtgc 
481 cagaacctcg gctacaacgt gaccaagatg cccaacttag tgggacacga gctgcagaca 
541 gacgccgagc tgcagctgac aactttcacg ccgctcatcc agtacggctg ctccagccag 
30 601 ctgcagttct tcctttgttc ggtttatgtg ccaatgtgca cagagaagat caacatcccc 
661 atcggcccgt gcggtggcat gtgcctttca gtcaagagac gctgtgaacc agtcctgaga 
721 gaatttgggt ttgcctggcc cgacaccctg aactgcagca agttcccgcc ccagaacgac 
781 cacaaccaca tgtgcatgga aggaccaggt gatgaagagg ttcccttgcc ccacaagact 
841 cccatccagc ccggggaaga gtgccactcc gtgggaagca attctgatca gtacatctgg 
35 901 gtgaagagga gcctgaactg tgttctcaag tgtggctacg atgctggctt gtacagccgc 
961 tcagctaagg agttcacgga tatttggatg gctgtgtggg ccagcctctg cttcatctcc 
1021 accaccttca ccgtgctgac cttcctgatt gattcatcca ggttttctta ccctgagcgc 
1081 cccatcatat ttctcagtat gtgctataat atttatagca ttgcttatat tgttcggctg 
1141 actgtaggcc gggaaaggat atcctgtgat tttgaagagg cggcagagcc cgttctcatc 
40 1201 caagaaggac ttaagaacac aggatgtgca ataattttct tgctgatgta cttttttgga 
1261 atggccagct ccatttggtg ggttattctg acactcactt ggtttttggc agccggactc 
1321 aagtggggtc atgaagccat tgaaatgcac agttcttatt tccacatcgc agcctgggct 
1381 attcccgcag tgaaaaccat tgtcatcttg attatgagac tagtggatgc cgatgaactg 
1441 actggcttgt gctatgttgg gaaccaaaac ctagatgccc tcactggctt tgtggtggct 
45 1501 cctctcttta cgtatttggt gattggaacg ctgttcattg cggcgggttt ggtggcctta 
1561 ttcaaaattc ggtccaatct tcaaaaagac gggacaaaga cagacaagtt ggaaaggcta 
1621 atggtcaaga tcggggtctt ctcagtactg tacacggttc ctgcaacctg tgtgattgcc 
1681 tgttatttct atgaaatctc aaactgggca ctctttcgat attctgcaga tgactcaaac 
1741 atggcagttg aaatgttgaa aatttttatg tctttgctcg tgggcatcac ttcaggcatg 
50 1801 tggatttggt ctgccaaaac tcttcacacg tggcaaaagt gttctaaccg attggtgaat 
1861 tctgggaagg taaagagaga gaagaggggg aatggttggg tgaagccagg aaaaggcaac 
1921 gagactgtgg tataagacta gccggcttcc tcgttcctca ttgtgaagga agtgatgcag 
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1981 ggaatctcag 

2041 ccaccactca 

2101 tccaaaatgg 

2161 ctttagcagg 
5 2221 tgatagtgtt 

2281 aagattaatt 

2341 agtgatctat 

2401 ggctggtcta 

2461 cactcccaag 
10 2521 agcaaacaaa 

2581 atttcaacat 

2641 aaaaaaaaag 

2701 aaagggaagg 

2761 agttgactta 
IS 2821 acttaaaata 

2881 aatgttaaat 

2941 caattacaat 

SEQIDNO: 114 

Amino acid sequence of mouse FZD4 encoded by the DNA sequence shown in SEQ ID NO: 
20 113. 

MAWPGTGPSSROAPGGVGLRLOLLLQLLLLLRPTLGFGDEEERRCDPIRIAMCQNU3YNV 
TKMPNLVGHEL<^ABI/5LTTFTPLIQYGCSSQL0PPLCSVYVPMCTBKINIPIGPCGGM 
CLSVKRRCEPVLREFGFAWPDTLNCSKPPPQNDHNHMCMEGPGDEEVPLPHKTPIQPGEE 
CHSVGSNSDQYIWKRSLNC^LKCGyDAGLYSRSAKEFTDIWMAVWASLCFISTTFTVLT 
25 FLIDSSRFSYPERPIIFLSMCYNIYSIAYIVRLTVGRERISCDFEEAAEPVLIQEGLKNT 
GCA 1 1 FLLMY FFGMASS I WWVI LTLTWFLAAGLKWGHEAI EMHS S YFH I AAWAX PAVKT I 
V I L IMRLVDADELTGLC YVGNQNLDALTGFWAPL FTYLV IGTLF I AAGLVALFKI RSNL 
QKDGTKTDKLERLMVKIGVFS VLYTVPATC VI AC YF YE I SNWALFRYS ADDSNMAVEMLK 
I FMSLLVGITSGMW I WSAKTLHTWQKCSNRLWSGKVKREKRGNGWVKPGKGNETVV 

30 SEQ ID NO: 115 

gi|12018309|reflNMJ)22623,l| Rattus norvegicus frizzled homolog 4 (Drosophila) (Fzd4), 
mRNA 



tttgaacaaa cttagaaaca cttcagccca cacacaccca cgtcagccca 
cccaactcag catcagaaga ccaatggctt cactgcagac tttggaatgg 
aaaagccagt tagaggtttt caaagctgtg aaaaatcaaa atgttgatca 
tcacagcttg gagtccgtgg aggtcccgcc tagattcctg aagcccaggg 
tgctcctact gggtgggatt tcaactgtga gttgataaca tgcaaggaga 
tttaaaaccc ttttaaattt taaatagtaa ctaggtcttg cagatagcaa 
aaacactgga aatgctgggt tgggagacgt gttgcagagt tttatagttt 
acataaacat cttctggcct acactgtctg ctgtttagaa ctctgtagcg 
aggtggtgtc aaaatccttc agtgcctttg tcgtaaaaca gaattgtttg 
agtactgtac taacacacgt aaggtatcca gtggatttct ctctcctgaa 
ccctaattct aggcagcccc tgttttcttc actttaaact aatgactcaa 
gttattttta taggattttt ttttgcactg cagcatgcct aatgagagga 
tgattcactt tctgacaatc acttaattca gagaaaaatg agatttgcta 
ccttaccgac cctagagacc tattgcatta agcaatgtta agcaattggg 
ttttagtttg tgtgattgca tctaggcaga cgccagtctg gaagaactga 
ttcttggcaa ctttgcattc acacagatta actgtgtaat ttgtgtgtgt 
taaaagcaca ttcttggacc atgaaaaaaa aaaaaaaaa 



1 gagcttgtta actagagcag gctcattcat taaatcgtgc acatgattct acagcagctg 
61 aagaagcggc cgctgccccc tcgtcgccct tggcgcccta ccgcattccc tatccggagc 

35 121 tgggagctgc gcggccaccg gcgcccctgt gcaaactggg ggtgtctgct agagcagcct 
181 ccgccgctgc tgcccgcggc tctggccatg gcttggcagg gcacagggcc aagcgtccgg 
241 gggatgcctg gaggcgtcag gctcaggctg gggctgctgc tgctgcagtt gctcctgctc 
301 cagcggcccg ccctggggtt cggggacgag gaggagcggc gttgcgaccc catccgcatc 
361 gccatgtgcc agaacctcgg ctacaacgtg accaagatgc ccaacttagt gggacacgag 

40 421 ctgcagacag acgcggagct gcagctgaca actttcacgc cgctcatcca gtacggctgc 
481 tccagccagc tgcagttctt cctttgttcg gtttatgtgc caatgtgcac agagaagatc 
541 aacatcccca tcggcccgtg cggtggcatg tgtctttcag tcaagagacg atgtgaacca 
601 gtcctgaaag aatttgggtt tgcctggccg gacagcctga actgcagcaa gttcccaccc 
661 cagaacgacc acaaccacat gtgcatggaa ggaccaggtg acgaagaggt acccttgccc 

45 721 cacaaaactc ccatccagcc gggggaagag tgccactccg tgggaaccaa ttccgatcag 
781 tacatctggg tgaaaaggag cctgaactgt gttctcaagt gtggctacga tgctggcttg 
841 tacagccgct cagctaagga gttcacggat atttggatgg ccgtgtgggc cagcctctgc 
901 ttcatctcta ccaccttcac tgtgctgacc ttcctgattg attcgtccag gttttcttac 
961 cct 9 a 9 c 9 cc ccatcatatt cctcagtatg tgctataata tttatagcat tgcttatatt 

50 1021 gttcggctta ctgtaggccg ggaaaggata tcctgtgatt ttgaagaggc ggcagaaccc 
1081 gttctcatcc aagagggact taagaacaca ggatgtgcaa taattttctt gctgatgtac 
1141 ttttttggaa tggccagctc catttggtgg gttattctga cactcacttg gtttttggca 
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1201 gccggactca agtggggtca cgaagccatt gaaatgcaca gttcttattt ccacatcgca 
1261 gcctgggcta tccctgccgt gaaaaccatt gtcatcttga ttatgagact agtggatgcc 
1321 gatgagctga ctggcctgtg ctatgtcggg aaccaaagcc tagatgccct caccggcttt 
1381 gtggtggcac ctctctttac ctatttggtg attggaactc tattcattgc tgcaggcttg 
5 1441 gtggccctct tcaaaattcg gtcaaatctt caaaaagatg ggaccaagac agacaagttg 
1501 gaaaggctaa tggtcaagat cggggtcttc tcggtcctgt acacggttcc tgcgacctgt 
1561 gtgattgcct gttatttcta tgaaatctca aactgggcac tctttcggta ttctgcagat 
1621 gactcaaaca tggcagttga aatgttgaaa atttttatgt ctttgctcgt gggcatcact 
1681 tcaggcatgt ggatttggtc tgccaaaact cttcacacgt ggcaaaaatg ttctaaccga 
10 1741 ttggtgaatt ctgggaaggt aaagagagag aagaggggga atggttgggt gaagccaggg 
1801 aaaggcaacg aaactgtggt gtaggactag ctagcttcca ctttcctcat tttgaaggaa 
1861 aggatgcagt gaatctcagt ttgaacaaac tagaaaca 

SEQlDNO:116 

Amino acid sequence of rat FZD4 encoded by the DNA sequence shown in SEQ ID NO: 115. 

15 MAWQGTGPSVRGMPGQVRLRLGLLLLQLLLliQRPALGFGDBEERRCDPIRIAMCQNLGYN 
VTKMPNLVGHELQTDAELQLTTFTPLIQYGCSSQIiQFFLCSVYVPMCTBKINIPIGPCGG 
MCLSVKRRCEPVLKEFGFAWPDSLNCSKFPPQNDHNHMCMEGPGDEEVPLPHKTPIQPGE 
ECHSVGTNSDQYIWVKRSLNCVLKCGYDAGLYSRSAKEFTDIWMAVWASLCFISTTFTVL 
TFL IDSSRFS YPERP 1 1 FLSMCYNI YS I AYI VRLTVGRER ISCDFBEAAEPVLIQEGLKN 

20 TGCAIIFLLMYFFGMASSIWWVILTLTWFLAAGLKWGHEAIEMHSSYFHIAAWAIPAVKT 
I VI L I MRLVDADELTGLCYVGNQS LDALTGFWAPLFTYL VI GTLF I AAGLVALFKIRSN 
LQKDGTKTDKLERLMVKIGVFSVLYTVPATCVIACYFYEISNWALFRYSADDSNMAVEML 
KI FMS LLVG I TSGMW I WS AKTLHT WQKCSMRLVNSGKVKREKRGNGVA^KPGKGNET VV 

SEQ ID NO: 117 

25 gi|5803150|ref|NMJ)0685U| Homo sapiens GLI pathogenesis-related 1 (glioma) (GLIPR1), 
mRNA 

1 ctctgttttc tcaaagctga agtcggctag gtttgcaaag ctgtgggctg agcactcagg 
61 caatcacact ctcagaaact gcggcggctc tggactgcag cctcccaagg ctccatgcca 
121 gacaaagcat gcgtgtcaca cttgctacaa tagcctggat ggtttctttt gtctccaatt 

30 181 attcacacac agcaaatatt ttgccagata tcgaaaatga agatttcatc aaagactgcg 
241 ttcgaatcca taacaagttc cgatcagagg tgaaaccaac agccagtgat atgctataca 
301 tgacttggga cccagcacta gcccaaattg caaaagcatg ggccagcaat tgccagtttt 
361 cacataatac acggctgaag ccaccccaca agctgcaccc aaacttcact tcactgggag 
421 agaacatctg gactgggtct gtgcccattt tttctgtgtc ttccgccatc acaaactggt 

35 481 atgacgaaat ccaggactat gacttcaaga ctcggatatg caaaaaagtc tgtggccact 
541 acactcaggt tgtttgggca gatagttaca aagttggctg cgcagttcaa ttttgcccta 
601 aagtttctgg ctttgacgct ctttccaatg gagcacattt tatatgcaac tacggaccag 
661 gagggaatta cccaacttgg ccatataaga gaggagccac ctgcagtgcc tgccccaata 
721 atgacaagtg tttggacaat ctctgtgtta accgacagcg agaccaagtg aaacgttact 

40 781 actctgttgt atatccaggc tggcccatat atccacgtaa cagatacact tctctctttc 
841 tcattgttaa ttcagtaatt ctaatactgt ctgttataat taccattttg gtacagctca 
901 agtaccctaa tttagttctt ttggactaat acaattcagg aaagaaaaaa cccaaaaacc 
961 aacctcattc acatatggct tttttttaac caataacaat taggtgtact tctattttaa 
1021 aacatttcag aaaaaaatat atgttatagc aatactctta c 

45 SEQ ID NO: 118 

• ■ -i t 

Amino acid sequence of human GLIPR1 encoded by the DNA sequence shown in SEQ ID 
NO: 117. 

MRVTIATIAWMVSEVSNYSHTAN1LPDIENEDFIKDCVRIHNKFRSEVKPTA8DMLYMTW 
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DPAIAQIAKAWASNCQFSHHTRIjKPPHKLHPNFT 

IQDYDFKTRICKKVCGHYTQVWADSYKVGCAVQFCPKVSOFDALSNGAHFICNYGPGGN 
YPTWPYKRGATCSACPNNDKCLDNLCVNRQRDQVKRYYSVVYPGWPiyPRNRYTSLFLIV 
NSVILILSVIITILVQLKYPNLVLLD 

SEQDDNO:119 

gi|21 3 1207 1 |ref)NM_028608. 1 1 Mus musculus GLI pathogenesis-related 1 (glioma) (Gliprl), 
mRNA 

1 gagcatgctg aagatggagc tcagaggcag agcacttgtc tagcataaac aaccctgggt 
61 taatccgagc tccaacaggg aaacagtctg cagactgaga gaaccgagca ttctatcaga 
121 accccgcagc tctggattct aggtccagca gcaaccagag agaccatgca ggtcatcctt 
181 gctgtgatag tctggatggc ttcgtctgtg tctagttctt catttacagc aagcactttg 
241 ccagatataa caaacgagga cttcattaaa gaatgtgttc aagttcacaa ccagcttcgg 
301 tcaaaagtga gtccaccagc ccggaatatg ctgtacatgt cttgggaccc aaaactagcc 
361 caaattgcaa aagcatggac aaaatcttgt gaatttaaac acaacccaca gctgcattca 
421 cggatacacc caaatttcac cgccctggga gagaatatct ggcttggctc tctatccatc 
481 ttttcagtat cctcagccat ctctgcctgg tatgaagaaa ttaagcacta tgacttcagc 
541 actaggaaat gtagacatgt ctgtggccat tatactcagg ttgtttgggc agacagttac 
601 aaacttggct gtgcagtgca actttgccct aatggagcaa attttatatg cgactatgga 
661 ccagcaggaa attacccaac gtggccatat aagcaaggag ccacgtgcag tgattgccca 
721 aaagatgaca agtgtctcaa cagtctctgc attaacccac gacgagacca ggtctcacgt 
781 tactactctg tcgattatcc agactggcct atatacctgc gtaacagata cacatctctc 
841 tttctcattg ctaagtcggt tctcctatta ctgtctgtta taattaccat ctgggtaaag 
901 cacaaatatc ctaacttggt tcttttggac taaagctgtg gttgggggac aactgaatca 
961 catgcggcta tttaaaaact tttcaataaa atctcagtca aaagg 

SEQIDNO: 120 

Amino acid sequence of mouse GLIPR1 encoded by the DNA sequence shown in SEQ ID 
NO: 119. 

MQVIIAVIW^SSVSSSSFTASTLPDITNEDFIKECVQVHNQLRSKVSPPAIUWLYMSW 
DPKLAQIAKAWTKSCEFKHNPQLHSRIHPNFTALGENIWLGSLSIFSVSSAISAWYEEIK 
HYDFSTRKCRHVCGHYTQWWADSYKLGCAVOLCPNGANFICDYGPAGNYPTWPYKQGAT 
CSDCPKDDKCLNSLCINPRRDQVSRYYSVDYPDWPIYLRNRYTSLFLIAKSVLLLLSVII 
TIWVKHKYPNLVLLD 

SEQIDNO: 121 

gi|27718248|reflXM_216892.1| Rattus norvegicus similar to GLI pathogenesis-related 1 
(glioma); related to testes-specific, vespid, and pathogenesis proteins (LOC299783), mRNA 

1 gtgaactgaa gatggggctc aggagcagag cacttgtcta gcataaacaa ccctgggttc 
61 attcaagctc caacatggaa actgtctgca gactgagcga accaagcatt ctatcagaac 
121 cctgaggccc tggattctag gtccagcagc agccagaggg accatgcagg tcctcctcgc 
181 tgtgatggtc tggatggctt cttctgcgtc tggtttttca tatacagcaa gtactttgcc 
242 aaaaataaca aacgaggact tcatcgaaga atgcgttgaa gttcacaacc actttcggtc 
301 aaaagcgtat ccaccggccg ggaatatgtt gtacatgtct tgggacccaa aactagccca 
361 aattgcaaaa gcgtgggcac agtcttgtgt atttcaacac aacccacagc tgcattcgcg 
421 aatacaccca aactttactg gcctgggcga aaacatttgg cttggctctc tatccctctt 
481 ctcagtacgt gcggccatcc tcgcctggtt tgaagaaagc cagtactatg acttcagcac 
541 tgggaaatgt aaaaaagtct gtgggcatta cactcagatt gtttgggcag atagttacaa 
601 gattggctgt gcagtgcaac tctgccccag aggagcaaat tttatttgca actatggacc 
661 agcagggaat tacccaacgt ggccatataa gcagggagcc acttgcagtg cttgcccaaa 
721 ggatgacaag tgcctgaaca atctctgcac taacccacaa cgagatcagg tctcacgtca 
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781 ctctgctgat tatccaaaat atctacgtaa cagatacaca tcgctctatc tcatcgctaa 
841 gtcagttctc ctattactgt ctgtcataat taccatttgg gtaaagcaca aatatcctaa 
901 cttagttctt ttggactaat gccctggtta ggggacaact tattcatatg tggtgatttt 
961 aaaatgtttc aataaaatct tagaagagtt 

SEQ JD NO: 122 

Amino acid sequence of rat GLIPR1 encoded by the DNA sequence shown in SEQ ED NO: 
121. 

hK?VLLAVMVWMASSASGFSYTASTLP 

DPKLAQIAKAWAQSCVFQHNPQLHSRIHPNFTGLGENIWLGSLSLF^ 
YYDFSTGKCKKVCGHYTQIVWADSYKIGCAVQLCPRGANFICNYGPAGNYPTWPYKQGAT 
. CSACPKDDKCLNNLCTNPQRDQVSRHSADYPKYLRNRYTSLYLI AKSVLLLLSVI ITIWV 
KHKYPNLVLLD 

SEQ ED NO: 123 

gi|766l847|ref]NM_014879.1| Homo sapiens G protein-coupled receptor 105 (GPR105), 
mRNA 

1 gaacagtgtt accttggagc ctacaatgag aggtatttca aaatgagtga agcatgactc 
61 tcacagatga aggcctagac gcaggatctt taatggaaaa acacttgggc cacttcaaga 
121 cgacaaacgc tcactgggca aaacaccttc actgaaaaga gacctcatat tatgcaaaaa 
18 i aaatcttaag aggcctctgc cttcagaagt tacaagatga tcaattcaac ctccacacag 
241 cctccagatg aatcctgctc tcagaacctc ctgatcactc agcagatcat tcctgtgctg 
301 tactgtatgg tcttcattgc gggaatccta ctcaatggag tgtcaggatg gatattcttt 
361 tacgtgccca gctctaagag tttcatcatc tatctcaaga acattgttat tgctgacttt 
421 gtgatgagcc tgacttttcc tttcaagatc cttggtgact caggccttgg tccctggcag 
481 ctgaacgtgt ttgtgtgcag ggtctctgcc gtgctcttct acgtcaacat gtacgtcagc 
541 attgtgttct ttgggctcat cagctttgac aggtattata aaattgtaaa gcctctttgg 
601 acttctttca tccagtcagt gagttacagc aaacttctgt cagtgatagt atggatgctc 
661 atgctcctcc ttgctgttcc aaatattatt ctcaccaacc agagtgttag ggaggttaca 
721 caaataaaat gtatagaact gaaaagtgaa ctgggacgga agtggcacaa agcatcaaac 
• 781 tacatcttcg tggccatctt ctggattgtg tttcttttgt taatcgtttt ctatactgct 
841 atcacaaaga aaatctttaa gtcccacctt .aagtcaagtc ggaattccac ttcggtcaaa 
901 aagaaatcta gccgcaacat attcagcatc gtgtttgtgt tttttgtctg ttttgtacct 
961 taccatattg ccagaatccc ctacacaaag agtcagaccg aagctcatta cagctgccag 
1021 tcaaaagaaa tcttgcggta tatgaaagaa ttcactctgc tactatctgc tgcaaatgta 
1081 tgcttggacc ctattattta tttctttcta tgccagccgt ttagggaaat cttatgtaag 
1141 aaattgcaca ttccattaaa agctcagaat gacctagaca tttccagaat caaaagagga 
1201 aatacaacac ttgaaagcac agatactttg tgagttccta ccctcttcca aagaaagacc 
1261 acgtgtgcat gttgtcatct tcaattacat aacagaaatc aataagatat gtgccctcat 
1321 cataaatatc atctctagca ctgccatcca atttagttca ataaaattca aatataagtt 
1381 tccatgcttt tttgtaacat caaagaaaac atacccatca gtaatttctc taatactgac 
1441 ctttctattc tctattaata aaaaattaat acatacaatt attcaattct attatattaa 
1501 aataagttaa agtttataac cactagtctg gtcagttaat gtagaaattt aaatagtaaa 
1561 taaaacacaa cataatcaaa gacaactcac tcaggcatct tctttctcta aataccagaa 
1621 tctagtatgt aattgttttc aacactgtcc ttaaagacta acttgaaagc aggcacagtt 
1681 tgatgaaggg ctagagagct gtttgcaata aaaagtcagg tttttttcct gatttgaaga 
1741 agcaggaaaa gctgacaccc agacaatcac ttaagaaacc ccttattgat gtatttcatg 
1601 gcactgcaaa ggaagaggaa tattaattgt atacttagca agaaaatttt ttttttctga 
1861 tagcactttg aggatattag atacatgcta aatatgtttt ctacaaagac ttacgtcatt 
1921 taatgagcct ggggttctgg tgttagaata tttttaagta ggctttactg agagaaacta 
1981 aatattggca tacgttatca gcaacttccc ctgttcaata gtatgggaaa aataagatga 
2041 ctgggaaaaa gacacaccca caccgtagaa catatattaa tctactggcg aatgggaaag 
2101 gagaccattt tcttagaaag caaataaact tgattttttt aaatctaaaa tttacattaa 
2161 tgagtgcaaa ataacacata aaatgaaaat tcacacatca catttttctg gaaaacagac 
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2221 ggattttact tctggagaca tggcatacgg ttactgactt atgagctacc aaaactaaat 

2281 tctttctctg ctattaactg gctagaagac attcatctat ttttcaaatg ttctttcaaa 

2341 acatttttat aagtaatgtt tgtatctatt tcatgcttta ctgtctatat actaataaag 

2401 aaatgtttta atactg 

SEQIDNO: 124 

Amino acid sequence of human GPR105 encoded by the DNA sequence shown in SEQ ID 
NO: 123. 

MINSTSTQPPDESCSQNLLITQQIIPVLYCMVFIAaiLLNGVSGWIPPYVPSSKSPIIYIi 
KNIVIADFVMSLTFPFKILGDSGLGPWQLNVFVCRVSAVLFYVNMYVSIVFFGLISFDRY 
YKI VKPLWTS F 1QSVS YS KLLS VI VWMLMLLLAVPN 1 1 LTNQS VREVTQ I KC I EL KS ELG 
RKWHKASNYIFVAIFWIWLLLIVFYTAITKKIPKSHLKSSRMSTSVKKKSSRNIFSIVF 
VFFVCFVPYHIARIPYTKSQTEAHYSCQSKEILRYMKEFTLLLSAANVCXOPIIYFFLCQ 
PFREILCKKLHIPLKAQNDLDISRIKRGNTTI.ESTDTL 

SEQIDNO: 125 

gi|34328344|reflNMJ33200.2| Mus musculus G protein-coupled receptor 105 (Gprl05), 
mRNA 

1 aattcggatc catggaaggc cgccacccca gcagactgaa gccagacgtg aaggagttca 
61 tgtaagggag tccctgctgt cctccagaca cactgatgcc tgggctacgg atggggacgg 
121 ggacgcaatg tgtctggaat tctctcttcc gaatcctgga ttctgttgac gaagcttgcc 
181 tttgagattc ctgaacacgg agaaatagag attaaaaacc ccagaagaga gaaagtaaat 
241 gattcacaat cttgatgggt tttgccgtat ttatgttctt ccactgtatt agataccagt 
301 cacaaatgac ttagaggcca taaactgtgc tttaagtaac tagcctgcct ttctatccag 
361 atctttgcct ccagaggtga gaagatgaac aactccacca ccacagaccc tccaaaccag 
421 ccctgctcct ggaacaccct gatcacaaag cagatcattc ccgtgttgta cggtatggtc 
481 ttcatcacgg ggctcctcct caatgggata tcaggatgga tattctttta tgtgcccagc 
541 tccaagagtt ttatcatcta tctcaagaac atagtggtgg ctgactttct catgggcctg 
601 actttccctt tcaaagtcct tggtgactca ggcctcggcc cctggcaggt gaatgtgttt 
661 gtgtgcaggg tctctgccgt catcttctat gttaatatgt acgtcagcat cgtgttcttt 
721 gggctcatca gctttgacag gtactataaa attgtgaagc cccttctgac gtctattgtg 
781 cagtcggtga actatagcaa gctgctttct gtgctcgtgt ggatgctcat gcttctcctt 
841 gctgtcccaa acatcatcct gacaaaccag ggtgtcaagg aggtcacgaa gatacagtgc 
901 atggagctca aaaacgagct ggggcggaag tggcacaagg cgtctaacta tatcttcgtg 
961 agtatcttct gggtcgtgtt tcttctgcta atcgtcttct acacggccat cacgaggaag 
1021 atcttcaagt ctcacctcaa gtccaggaag aattccacct ccgtcaagag gaagtccagc 
1081 cgcaatatct tcagcatcgt gctcgttttt gtcgtctgct ttgtgcctta ccacattgcc 
1141 agaatcccct acacaaagag tcagacggaa ggtcactaca gctgccggac gaaggagacc 
1201 ctgctctatg cgaaagaatt cactctgcta ctctcggctg ccaatgtgtg tctggacccc 
1261 attatttatt tcttcttatg ccagccattt agagaagtct taaataagaa gttacacatg 
1321 tcactcaaag tccagaatga cctagaggtt tccaaaacca aaagggaaaa tgcgattcat 
1381 gaaagcacag atactttgta aattcccatc cccttccaag tattatcagt cttgttacat 
1441 gataattaag atacatgaat aaaaagcagg catatgatga taagtaactt agctagcaat 
1501 atatctaata atatgtatga agtccaaaaa ggtataataa aaataaaata taagtttcca 
1561 tgcaaaatgg aagtntgtag cacatcacat ttttttagaa atcaaaggaa cagagaagtg 
1621 gctttgtggg tgctggcgta tgagttacca aaaccaaact tctcttctat taactggctt 
1681 cttagaagac acccagtctt tccgaccttc ctcctaagca ttcttccaag caacactcgt 
1741 atctatttca tgctttgcac tatgcatgtg ccaataaaca agttgtcttc aaaacccaaa 
1801 aaaaaaaaaa aaaaaaaaaa agggcggccg caagcttatg tatttaa 

SEQ ID NO: 126 
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Amino acid sequence of mouse GPR105 encoded by the DNA sequence shown in SEQ ID 
NO: 125. 

^INNSTTTDPPNQPCSWNTLITKQI IPVLYGMVPITGLLLNGISQWIFFYVPSSKSFI I YL 
KNIWADFLMGLTFPFKVLGDSGIiaPWQV^I^CRVSAVIFYVN^l^SIVFFGLISFDRY 
5 YKI VKPLLTS IVQSWYSKLLSVLVWMLMLLIAVPNI ILTNQGVKEVTKIQCMELKNELG 
RKWHKASNYIFVSIFWWFLLLIVFYTAITRKIFKSHLKSRKNSTSVKRKSSRNIFSIVL 
VFWCFVPYHIARIPYTKSQTEGHYSCRTKBTLLYAKEFTLLLSAANVCLDPIIYFFLCQ 
PFREVLNKKLHMSLKVQNDLBVSKTKRENAIHESTDTL 

SEQ ID NO: 127 

10 gi|25742688|ref|NM_l 33577. 1 1 Rattus nowegicus G protein-coupled receptor 105 (Gprl05), 
mRNA 

1 aattctctct tccgcaccct gggttctgtg gatgaaactt gcctctgaga atcctgaaca 
61 tggagaaata gagataaata ccccagaaga gggggagtaa gtgttcacaa ccttggtggg 
121 tttcgcctca tgtctgtccc tccactttaa gagatgccgg tcacctgagg gccacaaact 
15 181 gcgctctaag taaccagcct gcctttctac ccagatcttt gtctccagaa gtgagaagat 
241 ggacaacaca acaaccacag aacctccaaa gcagccctgc acccggaaca ccctgatcac 
301 acagcagatc atccccatgt tgtactgtgt ggtcttcatc acaggggtcc tcctcaacgg 
361 aatatcggga tggatattct tttacgtgcc cagctctaag agttttatca tctatctcaa 
421 gaacatagtt gtggctgact ttctcatggg cctcactttc cctttcaaag tcctcagcga 
20 481 ctcaggcctc ggtccctggc agctgaatgt gtttgtattc agggtgtctg ccgtgatctt 
541 ctacgtcaac atgtacgtca gcatcgcgtt cttcgggctc atcagctttg acaggtacta 
601 taaaatcgtg aagccccttc tggtgtctat cgtccagtca gtgaactaca gcaaagtgct 
661 gtccgtgctt gtgtgggtgc tcatgcttct cctcgctgtc cccaacatca ttctgacaaa 
721 ccagagtgtc aaggatgtca ctaacataca gtgcatggaa ctcaaaaacg agctggggcg 
25 781 gaagtggcac aaggcgtcta actatgtctt cgtgagcatt ttctggatcg tgttccttct 
841 gctgaccgtc ttctacatgg ccataacgag gaagatcttc aagtctcacc tcaagtccag 
901 aaagaattcc atctccgtca aaaggaagtc cagccgcaat atattcagca ttgtgctcgc 
961 atttgtcgcc tgtttcgcgc cttaccacgt tgccaggatc ccctacacca agagccagac 
1021 ggaaggacac tacagctgcc aggccaagga gaccctgctc tatacgaaag aattcaccct 
30 1081 gctgctctcg gctgccaatg tgtgtctgga ccccatatct atttcttctt atgccagccg 
1141 tttagagaag tcttgaataa gaagttaaga atgtcactaa cagtccagaa tgacctagag 
1201 acttccaaaa ccagaagggg aaatatgatt caggaaagca cagatacttt gcaattctca 
1261 ccctttccaa gtattatgga ccttgttaca cggtaattaa gatgtatgaa gtgaaaagca 
1321 gaaaagtata ataaaaatga aaggtaagtt cccatgtaaa gtgaaagtct acagcacgca 
35 1381 gaaaagtata ataaaaatga aaggtaagtt cccatgtaaa gtgaaagtct acagcacgtc 
1441 acatcttttt agaaatcgaa tggagaagtg gcatgtgggt gctagtctgc gagttacccc 
1501 aaaactaaac tctctcttct attaacgggc ttcttagaag acacccaccc tttcagaagt 
1561 tcctcctgag tatccttcta agtaatgctc acctctgttt catgctttgt tctgtgtatg 
1621 cgccaataaa caagttgtct ttaaaacccc aaa 

40 SEQ ID NO: 128 

Amino acid sequence of rat GPR105 encoded by the DNA sequence shown in SEQ ID NO: 
127. 

MDNTTTTEPPKQPCTRNTLITQQIIPMLYCWFITGVLLNGISGWIFFWPSSKSFII 
KNI WADFLMGLTFPFKVLSDSGLGPWQLNVFVFRVSAVI FYVNMYVS IAFFGLI SFDRY 
45 YKIVKPLLVSIVQSV^SK\^SVLVWVLM^ 

RKWHKASNYVFVS I FW I VFLLLTVFYMAITRKI FKSHLKSRKNS I S VKRKSSRN I FS I VL 
AFVACFAPYHVARI P YTKSQTEGHYSCQAKETLLYTKEFTWiLSAANVCLDPI S ISSYAS 
RLEKS 

SEQ ID NO: 129 
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gj|19923974|ref]NM_138445.1| Homo sapiens G protein-coupled receptor 146 (GPR146), 
mRNA 

1 ggcacgaggc gccggccgcc atgtggagct gcagctggtt caacggcaca gggctggtgg 
61 aggagctgcc tgcctgccag gacctgcagc tggggctgtc actgttgtcg ctgctgggcc 
5 121 tggtggtggg.cgtgccagtg ggcctgtgct acaacgccct gctggtgctg gccaacctac 
181 acagcaaggc cagcatgacc atgccggacg tgtactttgt caacatggca gtggcaggcc 
241 tggtgctcag cgccctggcc cctgtgcacc tgctcggccc cccgagctcc cggtgggcgc 
301 tgtggagtgt gggcggcgaa gtccacgtgg cactgcagat ccccttcaat gtgtcctcac 
361 tggtggccat gtactccacc gccctgctga gcctcgacca ctacatcgag cgtgcactgc 
10 421 cgcggaccta catggccagc gtgtacaaca cgcggcacgt gtgcggcttc gtgtggggtg 
481 gcgcgctgct gaccagcttc tcctcgctgc tcttctacat ctgcagccat gtgtccaccc 
541 gcgcgctaga gtgcgccaag atgcagaacg cagaagctgc cgacgccacg ctggtgttca 
601 tcggctacgt ggtgccagca ctggccaccc tctacgcgct ggtgctactc tcccgcgtcc 
661 gcagggagga cacgcccctg gaccgggaca cgggccggct ggagccctcg gcacacaggc 
15 721 tgctggtggc caccgtgtgc acgcagtttg ggctctggac gccacactat ctgatcctgc 
781 tggggcacac ggtcatcatc tcgcgaggga agcccgtgga cgcacactac ctggggctac 
841 tgcactttgt gaaggatttc tccaaactcc tggccttctc cagcagcttt gtgacaccac 
901 ttctctaccg ctacatgaac cagagcttcc ccagcaagct ccaacggctg atgaaaaagc 
961 tgccctgcgg ggaccggcac tgctccccgg accacatggg ggtgcagcag gtgctggcgt 
20 1021 aggcggccca gccctcctgg ggagacgtga ctctggtgga cgcagagcac ttagttaccc 
1081 tggacgctcc ccacatcctt ccagaaggag acgagctgct ggaagagaag caggaggggt 
1141 gtttttcttg aagtttcctt tttcccacaa atgccactct tgggccaagg ctgtggtccc 
1201 cgtggctggc atctggcttg agtctccccg aggcctgtgc gtctcccaaa cacgcagctc 
1261 aaggtccaca tctgcaaaag cctcctcgcc ttcagcctcc tcagcattca gtttgtcaat 
25 1321 gaagtgatga aagcttagag ccagtattta tactttgtgg ttaaaatact tgattccccc 
1381 ttgtttgttt tacaaaaaca gatgtttcct agaaaaatga caaatagtaa aatgaacaaa 
1441 accctacgaa agaatggcaa cagccagggt ggccgggccc tgccagtggg cggcgtgtgc 
1501 tagcaaggcc tgccgggtgt gccgcagtca ccacagggtt ctgagaacat ttcacagaag 
1561 tgcctgagac gcggagacat ggctggtgtt aaatggagct attcaatagc agtgacgcgc 
30 1621 tctcctcagc caccaaatgt ccctgacacc ctccccagcc cccacagata acatcagctg 
1681 aggttttttt cagtatgaac ctgtcctaaa tcaattcctc aaagtgtgca caaaactaaa 
1741 gaatataaat aaacaaaaga aaggtgaaaa aaaaaaaaaa aaaaa 

SEQIDNO:130 

Amino acid sequence of human GPR146 encoded by the DNA sequence shown in SEQ ID 
35 NO: 129. 

MWSCSWFNGTGLVEELPACQDLQLGLSLLSLLaLVVGVPVQLCYNALLVLANLHSKASMT 
MPDVYFVNMAVAGLVLSAIiAPVHLLGPPSSRWALWSVGGEVHVAXKJIPFNVSSLVAMVST 
ALLS LDH Y I ERALPRTYMAS VYNTRHVCGFVWGGALLTS PS S LLF Y I CS HVSTRALECAK 
MQNAEAADATLVFIGYWPALATLYALVLLSRVRREDTPLDRDTGRLEPSAHRLLVATVC 
40 TQFGLWTPH YL ILLGHTV 1 1 SRGKPVDAHYLGLLHFVKDFSKLLAFSS S F VTPLL YR YMN 
OSFPSKLQRLMKKLPCGDRHCSPDHMGVQQVLA 

SEQ ID NO: 131 

gi|31981401|reflNM 030258.2| Mus musculus cDNA sequence BC003323 (BC003323), 
mRNA 

45 i ggtacacagc ccccacggcg tgcgcgccgt gjagcfcccgct gcctcctgca agcccagctg 

61 cccgcccgcg gccggtgcgc gggccgccgc ccagcgagcg ctgcgaagct ggggtgagct 

121 gccatgtgga gctgtggccc actcaacagc acagcgtggg ctgaggagcc gctgtgccgg 

181 aacctgcgcc tggggctgtg ggtcctctcg ctgctctacc tgggggcagg ggtccctgtg 

241 a 9 ctta 99 ct ataatgctct tctggtgctg gccaacctgg ccagcaagaa caccatgacc 

50 301 atgcccgacg tgtacttcgt gaacatggcc gtggcggggc tggtgctcac ggcactggca 

361 cctgcgtacc tgttgggccc tgcccactcc aggtgggccc tgtggagcct cagcagtgag 
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421 gcccatgtga cactgctcat cctgttcaac 

481 gcactgctga gccttgacta ctacatcgag 

541 gtgtacaaca cccggcacgt gtgtggcttc 

601 tcctccctgc tcttctacat ctgcagtcac 

661 atgcagaaca cggaggcagc cgatgctatc 

721 ctggctgtgt tgtatgccct ggcactcatc 

781 gaccaggaca ccagcaggct ggacccctcg 

841 actcagtttg gcctctggac accttactac 

901 cgggggagga ccgtggaggg gcattatctg 

961 aagttcctgg ccttctcaag cagttctgtg 
1021 gccttccccg gcaagctccg gcggctgatg 
1081 tcccccgacc cctcggggat acagcaggtg 
1141 ctgaccacga ggaacactaa aactcaactg 
1201 ggagcttcca gcccaggaga cacgctgaag 
1261 atgtttcctg ttttccccta aaaggccaac 
1321 aggtgccctg tgtgaacagg ctgcttctga 
1381 gtgcctttct cccagggaca tcttcaaggt 
1441 cctccctggt atttgggctt tctaaatgaa 
1501 tcatattgac atgatacttg attctcctat 
1561 aagataccta gtattaagaa gaaggcactt 
1621 gttagagctt gaggacagga cagtgggcct 
1681 ctgacagtga cagctatgga gcactggccc 
1741 gcttcaacat atgcatat 

SEQIDNO: 132 

Amino acid sequence of mouse GPR146 encoded by the DNA sequence shown in SEQ ID 
NO: 131. 

MWSCGPLNSTAWAEEPLCRNLRLGLWVLSLLYLGAGVPVS^ 

PD VY F VNMAVAGL VLT ALAP AYLLG P AHSRW ALWS LS S EAHVTLL I L FNVAS L VTMYS T A 
LLSLDYYIERALPRTYMASVYNTRHVCQFVWGGAVLTSFSSLLFYICSHVSSRIAECARM 
QNTEAADAILVLlGYWPGLAVLYAlJaiSRIGKEDTPLDQDTSRLDPSVHRLLVATVCT 
QFGLWTPYYLSIiGHTVLTSRGRTVEGHYliGILQVAKDLAKFLAFSSSSVTPLLYRYIN^ 
FPGKLRRLMKKMHCGRRHCS PDPSG I QQVMAQA 

SEQIDNO: 133 

ENSRNOT00000001733 cDNA sequence, EnsEMBL transcript [Rattus norvegicus] 

1 atgtggagct gtggcccact caacagcaca gcgtgggctg aggagccgct gtgccggaac 

61 ctgcgcctgg ggctgtgggt cctctcactg ttctacctgg gggcaggtgt ccctgtgggc 

121 ttaggctaca atgctctttt ggtgctggcc aatctggcca gcaagaacag catgaccatg 
181 cctgatgtgt acttcgtgaa catggctgtg gcggggctgg tgctcacagc actggcacct 

241 gcgtacctgc tgggtcctgc ccactccagg tgggccctgt ggagcctcag cagcgaggcc 

301 catgtgacac tgctcatcct gttcaacgtg gcttccctgg tgaccatgta ctccactgca 

361 ctgctgagtc ttgactacta catcgagcgt gccctgccgc gtacctacat ggctagtgtg 

421 tacaacaccc ggcacgtgtg tggcttcgtc tggggagggg cagtgctcac cagcttttcc 

481 tccctgctct tctatatctg cagtcatgtg tcttctagaa ttgccgagtg tgcccggatg 

541 cagaacacgg aggcagccga cgccatcctt gtgctcattg gctacgtggt gccaggtctg 

601 gctgtgtt'gt atgccctggc actcatctca aggattggga aggaagacac acccctggac 

661 caggacacca gcaggctgga cccctcagtg cacaggctgc tggtggccac tgtgtgcaca 

721 cagtttggcc tctggacacc ttactacctg agcctggggc apacagtgct agtgtcacgg 

781 ggaaggaccg tggaggggca ttatctgggc atcctacagg ttgctaagga cctggcaaag 

841 ttcttggcct tctcaagcag ttctgtgacg ccgctgctct accgttacat caacaaagcc 

901 ttccccagca agctccggcg cctggtgaaa aagatacact gtgggcgccg ccactgctcc 

961 cccgacccte cggggataca gcaggtgatg gcccaggcgt ag 



gtggcttccc tggtgaccat gtactccact 
cgtgccctgc cacgcaccta catggccagt 
gtctggggag gggcggtgct caccagcttc 
gtgtcttcta gaatcgctga gtgtgcccgg 
cttgtgctca tcggctacgt ggtgccaggt 
tcgagaatcg ggaaggaaga cacacccctg 
gtgcacaggc tgctggtggc caccgtgtgc 
ttgagcctgg ggcacacagt gctgacgtca 
ggcatcctac aggttgctaa ggacctggcc 
acaccactgc tctaccgtta catcaacaaa 
aagaagatgc actgcgggcg ccgccactgc 
atggcacagg cgtagctaac cctccctggg 
gacacatagc actttgttcc ccaggcacat 
ccagagatgc acagcggggc atttctcttg 
cttggtctta tgctgcatgg tggaaagcag 
ctaaccttcc caccttctcc ctcaaggtgt 
ccatgtcctc acaagcaggc tgaccttcgg 
gcgatgaaat ctaaagccag tatttatact 
tattattttt ttaaaaatag aagtgttaga 
caggggaatg tggctgtcgc tgataccagt 
gtcaggcctg cagcagctac acagccattg 
tctcctctac tgagagagga tctttccagt 
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SEQ ID NO: 134 

Amino acid sequence of rat GPR146 encoded by the DNA sequence shown in SEQ JD NO: 
133. 

MWSCGPLNSTAWAEEPLCRNI*RLGLW^ 
5 PDVYFVNMAVAGLVLTAIAPAYLMPAHSRWAL^ 

LLSLDYYIERALPRTYMASVYNTIWVCGPVWGGAVLTSFSSLLFYICSHVSSRIAECARM 
QNTEAADA I LVL IG YWPGLA VL Y ALAL I SRIGKEDTPLDQDTSRLDPS VHRLL VATVCT 
QFGLWTPYYLSLGHTVLVS RGRTVEGHYLG I LQVAKDLAK FLAFSS S S VTPLL YR Y INKA 
F PS KLRRLVKKI HCGRRHCS PDP AG I QQVMAQA 

10 SEQ ID NO: 135 

gi|4504090|reflNM_001 505. 1 1 Homo sapiens G protein-coupled receptor 30 (GPR30), mRNA 



15 



20 



25 



30 



1 ggaaaacgac acctagaagt aggagtgaga ttcgctgaag ttcccttctg aggaagaccc 
61 acccctccgc ctggagagcc ggggctggcg gtgcctgagg accccttcgg cctggacagc 
121 ccacgcgggc ttggggggcc tcgctctgcc ctcatggggc ggccatcggt tcccgaagcg 
181 gcgagtgaaa attcaaatgg ccagtagggg gcgcactcgg aagtggccgc cccgcatgag 
241 gcagttcagc ggccccgaga gtccggggag ggaggtttat tctccgcctg cacgagactg 
301 tgaaatccgc aaccatgagc aggagaggcg gccctggtgg ggaagaggcc accaacatct 
361 ggacggcagg tacccagaga gtgagcagct ccacgcggga ctgtgcacgg tggccgacac 
421 ccgcagggac gcccgccgga cgagcacgcg gagggccctc gcctccacgg atgcaccatg 
481 ccggtgtgag gagcatctgt tcttcccact ctctgcagtt aacaaaccca acccaaacca 
541 ccacaggtgc tcctcctggg gagtttcctg tctgacaaat gccaggctca cttcaaggag 
601 aatcacgctt ctttctaaag atggattcac catttaaaac agagctctgg gagcctttcg 
661 gcaaatcttg aaagctgcac ggcgcagaga catggatgtg acttcccaag cccggggcgt 
721 gggcctggag atgtacccag gcaccgcgca gcctgcggcc cccaacacca cctcccccga 
781 gctcaacctg tcccacccgc tcctgggcac cgccctggcc aatgggacag gtgagctctc 
841 ggagcaccag cagtacgtga tcggcctgtt cctctcgtgc ctctacacca tcttcctctt 
901 ccccatcggc tttgtgggca acatcctgat cctggtggtg aacatcagct tccgcgagaa 
961 gatgaccatc cccgacctgt acttcatcaa cctggcggtg gcggacctca tcctggtggc 
1021 cgactccctc attgaggtgt tcaacctgca cgagcggtac tacgacatcg ccgtcctgtg 
1081 caccttcatg tcgctcttcc tgcaggtcaa catgtacagc agcgtcttct tcctcacctg 
1141 gatgagcttc gaccgctaca tcgccctggc cagggccatg cgctgcagcc tgttccgcac 
1201 caagcaccac gcccggctga gctgtggcct catctggatg gcatccgtgt cagccacgct 
1261 ggtgcccttc accgccgtgc acctgcagca caccgacgag gcctgcttct gtttcgcgga 
1321 tgtccgggag gtgcagtggc tcgaggtcac gctgggcttc atcgtgccct tcgccatcat 
35 1381 cggcctgtgc tactccctca ttgtccgggt gctggtcagg gcgcaccggc accgtgggct 
1441 gcggccccgg cggcagaagg cgctccgcat gatcctcgcg gtggtgctgg tcttcttcgt 
1501 ctgctggctg ccggagaacg tcttcatcag cgtgcacctc ctgcagcgga cgcagcctgg 
1561 ggccgctccc tgcaagcagt ctttccgcca tgcccacccc ctcacgggcc acattgtcaa 
1621 cctcgccgcc ttctccaaca gctgcctaaa ccccctcatc tacagctttc tcggggagac 
1681 cttcagggac aagctgaggc tgtacattga gcagaaaaca aatttgccgg ccctgaaccg 
1741 cttctgtcac gctgccctga aggccgtcat tccagacagc accgagcagt cggatgtgag 
1801 gttcagcagt gccgtgtaga cagccttggc cgcataggcc cagccagggt gtgactcggg 
1861 agctgcacac acctgggtgg acacaaggca cggccacgtc atgtctctaa actgcggtca 
1921 gatgtggctt ctggctcctc ggggcctcgc gagggtcacg cttgcctggt caccctgggg 
1981 ctgcttagga aacctcacga ctggtcacct tgcactcttc acacagaatt gctacaatcc 
2041 caaagcgctc gccccgcagg gtccaaaggc cagcggtgac cagcccgtca cccagctcct 
2101 ccccgccaac cctgcctgcc gctgcacctg cctgccgctg caggaaacat ttgacaccgt 
2161 cgaccaggaa agccacacgg agaggccact gtgggtgaag cgcctcagtt acacaggaac 
2221 cctaaagcaa atctgccacc gtgggggaac tgacgctgga gatgcaaggt gctggtgggt 
2281 ctgagctgga cgtcgcggtg tgtcctctgt gcccacggtc tgagctagct agcgcaccgc 
2341 cgagttaaag aggagaagga aaacatgctg ctctggtgca cgcctgagcg tcctccatct 
2401 tccaggatgg cagcaatggc gctgtgcggc ctcaccaggc ccacgaggag cagcagcgct 
2461 cggcccggag cagcaggaag gcccctctgt ggagcgcccg ccgtctgctc cggggtggtt 
2521 cagtcactgc ttgttgacat caacatggca attgcactca tgtggactgg gaccgtgcga 
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2581 gctgccgtgt gggttagtcg ggtgccagga caatgaaata ctccagcacg tgtggctgac 
2641 gaatttgttt ctacagaaat aacagctggg gacaactgcg gtgatgatgt aaaaaccttc 
2701 ccataaaatg taagaaaagc tgatgaggct ggtgacgttc agcctttgtc aataaacctg 
2761 tcatgtgcgg atcctt 

5 SEQIDNO:136 

Amino acid sequence of human GPR30 encoded by the DNA sequence shown in SEQ ID 
NO: 135. 

MDVTSQARGVGLEMYPGTAQPAAPNTTSPELNLSHPIJjGTALANGTGELSEHQQYVIGLF 
LSCLYTIFLFPIGPVCailLILWNISFREKMTIPDLYFINLAVADLILVADSLIEVFNLH 
1 0 ERYYDIAVLCTFMSLFLQVNMYSSVFFLTWMSFDRYlAIiARAMRCSLFRTKHHARLSCGL 
IWMASVSATLVPFTAVHLQHTDEACFCFADVREVQWLEVTLGFIVPFAIIGLCYSLIVRV 
LVRAHRHRGXJ*PRRQKALRMILAVVLVFFVCWLPENVFISV^ 
AHPLTGHIVNIJU^FSNSCIiNPLIYSFLGETFRDKLRLYIEQKTNLPALNRFCHAAL 
PDSTEQSDVRFSSAV 

15 SEQ ID NO: 137 

gi|3808l363|reflXM_355659.1| Mus musculus G protein-coupled receptor 30 (Gpr30), 
mRNA 



l ataaaggagg cgctgtgcca agggggccag acgctgctgg acggccacag gcatccatcc 
61 ccaggcatcg ggcgggtgct tctgttcctc tcctgctggg tccctgctgg gcaccgtccc 
20 121 caaagtgctg caagtccagg gtccatccct ggagcaagct ccaggagcac ctccagcaga 
181 tggcctggta acgccacggc acagatcagg acacccaaca gaaaatcaga aggacactaa 
241 gtctgatcgt tagattaaca gagcagcgat ctggaccaag gacagaagcc agggtgtcat 
301 ttctgccatg cacccaccaa aacagctgat cagatctagg gagaaagcca tccaaggact 
361 ctgctcccct taagctgctg gaattgtggc caagcctcaa cactcacaca ctctgggtgc 
25 421 ccagaaggtg agcaggcagc aggtgtgcct gcccagcacc agcccagaca tcagacaccc 
481 tgtccaccct tctggttttc tgagactaac aggctcccag gacgattctt cctgcctcac 
541 aaatgcctgg tttatctttc tttgtgaaga tggagctgtc acataaaaca gctttctgtg 
601 accctttcag caaatcctga aaactgccga gggaagccat ggatgcgact actccagccc 
661 aaactgttgg ggtggagatc tacctaggtc ccgtgtggcc agccccttcc aacagcaccc 
30 721 ctctggccct caacttgtcc ctggcactgc gggaagatgc cccggggaac ctcactgggg 
781 acctctctga gcatcagcag tacgtgattg ccctcttcct ctcctgcctc tacaccatct 
841 tcctctttcc tattggcttt gtgggcaaca tcctcatcct ggtggtgaac atcagcttcc 
901 gggagaagat gaccatccca gacctgtact tcatcaacct ggcggcggcc gacctcatcc 
961 tggtggctga ctccctgatt gaggtgttca acctggacga gcagtactac gacatcgcag 
35 1021 tgctctgcac cttcatgtcc ctcttcctgc agatcaacat gtacagcagc gtcttcttcc 
1081 tcacctggat gagcttcgac aggtacctag cgctggccaa ggccatgcgc tgtggcctct 
1141 tccgcaccaa gcaccacgca cggctcagct gtggcctcat ctggatggcc tcagtgtccg 
1201 ccacgctggt gcccttcaca gcggtgcacc tgcggcacac ggaggaggcc tgcttctgct 
1261 ttgctgatgt cagggaggtg cagtggctgg aggtcacact gggcttcatc atgcccttcg 
40 1321 ccatcattgg cctctgctac tccctcatcg tgcgagccct catccgggcc cacaggcacc 
1381 gcggcctgcg cccacgcagg cagaaagccc tgaggatgat cttcgcagtg gtccttgttt 
1441 tcttcatctg ctggctgccg gagaacgtct tcatcagtgt ccacctactg cagtggacgc 
1501 agccagggga cactccctgc aagcagtctt tccgtcacgc ctaccccttg acaggccaca 
1561 tagtcaacct tgcagccttc tccaacagct gcctgaatcc cctcatctac agcttcctgg 
45 1621 gagagacctt cagggacaag ctcaggctct atgtggagca gaagacgagc ctgccggctc 
1681 tgaaccgctt ctgccatgcc acgctcaagg ccgtcattcc agacagcaca gagcagtcag 
♦ 1741 aggtcaggtt cagcagtgct gtgtgagagg aaaaggtcag gggcgcaggc tggtgctcag 
1801 gacttgcaca cacctagcac aggtggtgag tgggctaagc tatgtcatac tctcaaaccc 
1861 cagtggcttg gggaagacgt cacattgcgg ggtcatctct ggagctgctg gcatccttcc 
50 1921 tgactgtcca gctcatggat gctgccatcc agattcaagg tcccaaggca gcgggccacg 
1981 tgacattgac ctctgacctc aaagggcacc aggcgggcct gctgcttggc tttctttcca 
2041 tagcctacgt tcccagaaca caagtctgct gttgatacga ggacaggcca tgctatggga 
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2101 gcaccatgtt acatgcctgc tacgtggagg agtctagaga cagactttat gtaccagacc 
2161 caaactggct accttccctt tgcttgtgat gtgtaactga ccatgtatac accgtccagt 
2221 gcagccagag ccttcttcct gtcttccaga aggctgtgag gtcaccccag atgccactcc 
22 81 taactcctga gtgaacagcg tgtctgactg agaaaggccc tttaacaaaa cgccttcctg 
5 2341 ctctgggatg ctcctctcac aaagtttgtt tacaaaggtg tttgcccttc cgtgaaggtg 
2401 gaaggagact gggtgctgct gtgcaggctg gtgggatgcc gccataagat gtgtggtaga 
2461 aggacttacc accacagaaa atcatactgg gaacagcgag ctgtaaatgg atctcattaa 
2521 aacgt 

SEQIDNO: 138 

10 Amino acid sequence of mouse GPR30 encoded by the DNA sequence shown in SEQ ID 
NO: 137. 

MDATTPAQTVGVEIYLGPWPAPSNSTPIALNLSI 

LSCLYTIFLPPIGPVGNILILVWlSPREKMTIPDLYPINIiAAADLILVADSLIEVFNLD 
BQYYD I AVLCTPMSLFLQ I NM YS S V FFLTWMS FDR YLALAKAMRCGL FRTKHHARLS CGL 
15 IWMASVSATLVPFTAVHLRHTEEACFCFADVREVQWLEVTLGFIMPFAIIGLCYSLIVRA 
L I RAHRHRGLR PRRQKALRM I FAWL VFF IC WLPENVF I S VHLLQWTQPGDTPCKQS FRH 
AYPIiTGHIVNLAAFSNSCLNPLIYSFLGETFRDKLRLYVEQKTSLPALNRFCHATLKAVI 
PDSTEQSEVRFSSAV 

SEQIDNO: 139 

20 gij 1 942426 1 |reflNM_l 33573. 1 1 Rattus norvegicus G protein-coupled receptor 30 (Gpr30), 
mRNA 

1 ttctgtgacc ctttcagcaa gtcctgaaag cttctacggg aagccatggc tgcaactact 
61 ccagcacaag atgttggcgt agagatctac ctaggtcccg tgtggccagc cccttccaac 
121 agcacccctc tggccctcaa cctgtccctg gcgctgcggg aagatgcccc ggggaacctc 
25 181 actggggacc tctctgaaca tcagcaatat gtgatcgctc tcttcctctc ctgcctctac 
241 accatcttcc tcttccccat cggctttgtg ggcaacatcc tcatcttggt ggtgaacatc 
301 agcttccggg agaagatgac tatcccagac ctgtacttca tcaacctggc agcggctgac 
361 ctcatcctgg tggccgactc cctgatcgag gtgttcaacc tggacgagca gtattacgat 
421 atcgccgtgc tctgcacctt catgtccctc ttcctgcaga tcaacatgta cagcagcgtc 
30 481 ttcttcctca cctggatgag cttcgacagg tacctggcgc tggccaaagc catgcgctgt 
541 ggcctcttcc gcaccaagca ccacgcgcgg ctcagctgtg gcctcatctg gatggcctca 
601 gtgtccgcca cgctggtgcc cttcacggcc gtgcatctgc ggcacaccga ggaggcctgc 
661 ttctgctttg ccgatgtcag ggaggtgcag tggctggagg tcacgctggg cttcattgtg 
721 cccttcgcca tcatcggcct gtgctattcc ctcatcgtgc gggccctcat ccgggcccac 
35 781 aggcatcgtg gcctgcgccc acgcaggcag aaagccctga ggatgatctt cgcagtggtc 
841 cttgtcttct tcatctgctg gctgccggag aacgtcttca tcagcgtcca cctactgcag 
901 tgggcgcagc caggggacac tccctgcaag cagtctttcc gtcatgccta ccccttgaca 
961 ggccacatag tcaacctggc agccttctcc aacagctgcc tgagtcccct catctatagc 
1021 ttcct 999 a 9 agaccttcag ggacaagctc aggctgtatg tggcgcagaa gacgagcctg 
40 1081 ccagctctca accgcttctg ccatgccacg ctcaaggcag tcataccaga cagcacggag 
1141 cagtcagatg tcaagttcag cagtgctgta tgagaggtac ctcctagagg aaaacggaca 
1201 ggggagcagg cgtgcccagg agctgcacac tctagcacag gtggtgggcg agctgagcca 
1261 tgtcatactc taaacccc 

SEQIDNO: 140 

'*•'»• . - 

45 Amino acid sequence of rat GPR30 encoded by the DNA sequence shown in SEQ ID NO: 
139. 

MAATTPAQDVGVEIYLQPVWPAPSNSTPLALNLSLALREDAPGNLTGDLSEHQQYVIALF 
LSCLYTIFLFPIOPVONILILWNISFREKMTIPDLYFI1JLAAADLILVADSLIEVFNLD 
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EQYYDIAVLCTFMSLFLQINMYSSVFFLTWMSFDRYLALAKAMRCGLFRTKHHARLSCGL 
IWMASVSATLVPFTAVHLRHTEEACFCFADVREVQWLEVTLGFIVPFAIIGLCYSLIVRA 
LIRAHRTOGLRPRRQKALRMIFAVVLVFFICWLPENVFISVHLLQWAQPGDTPCKQSFRH 
AYPLTGHIVNLAAFSNSCLSPLIYSFIiGBTFRDKLRLYVAQKTSLPALNRFCHATLKAVI 
5 PDSTEQSDVKFSSAV 

SEQ ID NO: 141 

gi|33695103|ref|NM_003608.2| Homo sapiens G protein-coupled receptor 65 (GPR65), 
mRNA 

1 ttcttgactt gatgcaggca cagatttatc aagctcctca gtcaacaaac acatcaccgg 
10 61 aagaaacatg gaaggaaagg aattttaaaa ggaaatacca atctctgtgc aaacaaagcc 
121 ttgtatattc atgtttgcac caatctactg tgagatttat gaagaaaaac aaattgcgga 
181 caactctcta tgtacactta caaatgcctc agttgatgct tgtgggctgt ttgtcagcgt 
241 tctgtgataa tgaacacatg gacttctgtt tattaaattc agttgacccc tttagccaat 
301 tgccaggagc ctggattttt acttccaact gctgatatct gtgtaaaaat tgatctacat 
IS 361 ccacccttta aaagcattga tgaattaatt agaactttag acaacaaaga aaaattgaaa 
421 aagaattctc agtaaaagcg aattcgatgt tcaaaacaaa ctacaaagag acaagacttc 
481 tctgtttact ttctaagaac taatataatt gctaccttaa aaaggaaaaa atgaacagca 
541 catgtattga agaacagcat gacctggatc actatttgtt tcccattgtt tacatctttg 
601 tgattatagt cagcattcca gccaatattg gatctctgtg tgtgtctttc ctgcaagcaa 
20 661 agaaggaaag tgaactagga atttacctct tcagtttgtc actatcagat ttactctatg 
721 cattaactct ccctttatgg attgattata cttggaataa agacaactgg actttctctc 
781 ctgccttgtg caaagggagt gcttttctca tgtacatgaa tttttacagc agcacagcat 
841 tcctcacctg cattgccgtt gatcggtatt tggctgttgt ctaccctttg aagttttttt 
901 tcctaaggac aagaagattt gcactcatgg tcagcctgtc catctggata ttggaaacca 
25 961 tcttcaatgc tgtcatgttg tgggaagatg aaacagttgt tgaatattgc gatgccgaaa 
1021 agtctaattt tactttatgc tatgacaaat accctttaga gaaatggcaa atcaacctca 
1081 acttgttcag gacgtgtaca ggctatgcaa tacctttggt caccatcctg atctgcaacc 
1141 ggaaagtcta ccaagctgtg cggcacaata aagccacgga aaacaaggaa aagaagagaa 
1201 tcataaaact acttgtcagc atcacagtta cttttgtctt atgctttact ccctttcatg 
30 1261 tgatgttgct gattcgctgc attttagagc atgctgtgaa cttcgaagac cacagcaatt 
1321 ctgggaagcg aacttacaca atgtatagaa tcacggttgc attaacaagt ttaaattgtg 
1381 ttgctgatcc aattctgtac tgttttgtaa ccgaaacagg aagatatgat atgtggaata 
1441 tattaaaatt ctgcactggg aggcgtaata catcacaaag acaaagaaaa cgcatacttt 
1501 ctgtgtctac aaaagatact atggaattag aggtccttga gtagaaccaa ggatgttttg 
35 1561 aagggaaggg aagtttaagt tatgcattat tatatcatca agattacatt ttgaaaagga 
1621 aatctagcat gtgaggggac taagtgttct cagagtgatg ttttaatcca gtccaataaa 
1681 aatatcttaa aactgcattg tacagctccc tccctgcgtt ttattaaatg atgtatatta 
1741 aacaaagatc aataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 

SEQ ID NO: 142 

40 Amino acid sequence of human GPR65 encoded by the DNA sequence shown in SEQ ID 
NO: 141. 

MNSTCIEEQHDLDHYLFPIVYIFVI IVSIPANIGSLCVSFLQAKKESELGIYLFSLSLSD 
LLYALTLPLWIDYTWNKDNWTFS PALCKGSAFLM YMNFYS STAFLTCIAVDRYLAWYPL 
KFFFLRTRRFALMVSLS IWILETI FNAVMLWEDETWEYCDAEKSNFTLCYDKYPLEKWQ 
45 INLNLFRTCTGYAIPLVTILICNRKVYQAVJttTN 

PFHVMLL IRC ILEHAVNFEDHSNSGKRTYTMYRITVALTS LNC VADP I L YCFVTETGRYD 
MWNILKFCTGRCNTcJQRQRKRILSVSTKDTMELEVIiE 

SEQ ED NO: 143 
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gi|l 103872|gb|U39827.1|MMU39827 Mus musculus putative G protein-coupled receptor 
TDAG8 (TDAG8) mRNA, complete cds 

1 gatacaagca gatttgccag cctcctcagt caagagaagc atccctccag aaacagggaa 
61 acatgacact tttgaaagaa tgccaaacgg cgtgaaaata aaaacagagc attcccattt 
5 121 gcaccgacca atctccaatc tcctgtaaga ttcaaaaggg caagcaagag gcggtgaccg 
181 ttcacgaaag ctaaaatccc atgctattga acatgaagac ttctgatgct taaatctcat 
241 taactgcttt aagtcactcc caggagcttg gatcccaact tctagcagta atagtctgtg 
301 taaaaaaaaa aaaaaaatca gtctacaacc actctctaaa tgcatggatg aactcatcag 
361 aacatcaaaa cccaaggaaa ccctaagaga gaagaattct aataaaaaga attttacatt 
10 421 gaaaacttac aaggcaaggt ccctttccct gctgacagcc taagaagtga tgtaactgcc 
481 actgtgaaga ccatggcgat gaacagcatg tgcattgaag agcagcgcca cctcgaacac 
541 tatttgttcc cggtggtcta cataattgtg tttatagtca gcgtcccagc caacatcgga 
601 tctttatgcg tatcctttct gcaagcgaag aaggaaaatg agctagggat ttacctcttc 
661 agtctgtccc tgtcagacct gctgtatgcg ctgactctgc ccctctggat caattacact 
15 721 tggaataaag acaactggac tttctctccc accttgtgca aaggaagcgt tttcttcacc 
781 tacatgaact tttacagcag cacggcgttc ctcacttgca ttgccctgga ccgctattta 
841 gcagtcgtct accctctgaa gttttccttc ctaagaacga gaagattcgc gtttattacc 
901 agcctctcca tctggatatt agagtccttc tttaactcta tgcttctgtg gaaagatgaa 
961 acgagtgttg aatattgtga ctcggacaaa tctaatttca ctctctgcta tgacaaatac 
20 1021 cctctggaga aatggcagat aaacctcaac ctgtttcgga cgtgcatggg ctacgcaata 
1081 cccttgatca ccatcatgat ctgcaaccat aaagtctacc gagctgtgcg gcacaaccaa 
1141 gccacggaaa acagcgagaa gagaaggatc ataaagttgc ttgctagcat cacgttgact 
1201 ttcgtcctat gctttacccc cttccacgtg atggtgctca tccgctgcgt tttagagcgc 
1261 gacatgaacg tcaatgacaa gtctggatgg cagacgttta cggtgtacag agtcacagta 
25 1321 gccctgacga gtctaaactg tgttgccgat cccattctgt actgctttgt gactgagacg 
1381 gggagagctg atatgtggaa catattaaaa ttgtgtacta ggaaacacaa tagacaccaa 
1441 gggaaaaaaa gggacatact ttctgtgtcc acaagagatg ctgtagaatt agagattata 
1501 gactaagagg tggaggcagg ttaagttaca tggtattatt taatgaaact tacattttgg 
1561 aaaagaaatc tggcatagta gaacccagtg gaaatagttt gaaggtacat tgtatgactc 
30 1621 ctatgttggc tttattaagt aaggtataga aatgtattat cttgtatgta ttctaatgac 
1681 taggcatcat tgttttagta ccaattctct ttgcctctat gttataaccc ctaagaagca 
1741 cgcgggactg ttcgtcttta aatcagtggc cattctatct gactactatg actttttgtt 
1801 gttgttctgc tttgggtttt cagtctgcct gcatcagtct tctcctctgt atacgtctgt 
1861 cttcaacaaa tgtaaggact aaatacccct cccgatcaca tccattatca aggatttgaa 
35 1921 gccactccat gtactgggtt ataaaagaaa tgttctcatg aactttcatg aagtttacat 
1981 acctttgggg atctagtcac cgagtcacat aaagtaaaag taaatggatt ttgtan 



SEQIDNO: 144 

Amino acid sequence of mouse GPR65 encoded by the DNA sequence shown in SEQ ED 
40 NO: 143. 

MAMNSMCIEEQHHLEHYLPPVVYIIVFIVSVPANIGSLCVSFLQAKKENELGIYLPSLSL 
SDLLYALTLPLWINYTWNKDNWTFSPTLCKGSVPPTYMNFYSSTAFLTCIALDRYLAVVY 
PLKPSPLRTRRFAFITSLSIWILESPPNSMLLWKDETSVEYCDSDKSNPTLCYDKYPLEK 
WQINLNLFRTCMGYAIPLITIMICNHKVYRAVRHNQATENSEKRRI IKLLAS ITLTFVLC 
45 FTPFHVMVLIRCVLERDMNVNDK5GWQTFTVYRVTVALTSLN 
MWNILKLCTRKHNRHQGQKRDILSVSTRDAVELEIID 

SEQIDNO: 145 

gi|276672l9|reflXM 234367. 1| Ratals norvegicus similar to Gpcr25 protein (LOC299242), 
mRNA 

50 1 atgacgatga acagcacatg tgttgaagaa cagcatgacc tggaccacta tttgttccca 

61 gtggtctaca tatttgtgtt tatagtcagc gtcccagcca acatcggctc tttatgtgta 
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121 tcttttctgc aagcgaagaa ggaaaatgaa ctaggagttt acctcttcag tctgtcactg 
181 tcagatctgc tgtatgcgtt gacgctgcct ctctggataa actacacttg gaataaagac 
241 aactggacgc tctctcccac cttgtgcaaa ggaagcgttt tcttcaccta cctgaacttt 
301 tacagcagca cggccttcct cacttgcatt gccttggacc gctatttagc agtcgtctac 
5 361 cctctaaggt tttcttttct aagaacaaga agatttgcat ttatcaccag cctgtctatc 
421 tggatattgg aatccgtctt taactccatc cttctgtggg aagacgaaac aagtgtcgaa 
481 tactgtgatg cagagaaatc taatttcact ctctgctatg acaaataccc tctggagaag 
541 tggcagataa accttaacct gttcaggact ttcctgggct acgcaatacc cttggtcacc 
601 atcatgatct gcaaccataa agtctaccgc gccgtgaagc acaaccaagc tacggaaaac 

10 661 aacgaaagga gacggatcat aaagctgctt gccagcatca cgctgacttt catcttatgc 
721 tttacccctt tccacgtgat ggtgctcatc cgctgcattt tagagcgcaa cgtgaacttc 
781 aataacaaga ctggaaagca gacatttacg gtgtacagaa tcacagtggc cctgacgagt 
841 ctaaactgtg tcgccgatcc aattctctac tgtttcgtga cggaaaccgg gagatccgat 
901 atgtggaaca tattaaaagt gtgtgctagg aagcgcaata gatcccgagg acaaaaaaga 

15 961 gacatacttt ctgtgtccac aagagatacc atagaactgg agattataga ataa 



SEQ ID NO: 146 

Amino acid sequence of rat GPR65 encoded by the DNA sequence shown in SEQ ID NO: 
145. 

20 MTMNSTCVEEQHDLDHYLFPVVyiFVPIVSVPANIOSLCVSFLQAKKENELGVYLFSIiSL 
SDLLYALTLPLWINYTWNKDNWTLSPTLCKGSVFFTYLNFYSSTAFLTCIALDRYLAWY 
PLRFSFLRTRRFAFITSLSIWILESVFNSIIjLWEDETSVEYCDAEKSNFTLCYDKYPLEK 
WQINLNLFRTFLGYAI PLVT IMICNHKVYRAVKHMQATENNERRRI IKLLAS ITLTFI LC 
FTPFHVMVLIRCILERNVNFlWKTQKQTFTVyRITVALTSI^CVADPILYCFVTETGRSD 

25 MWNILKVCARKRNRSRGQKRDILSVSTRDTIELEIIE 

SEQ ID NO: 147 

gi|40254430|reflNM_000867.2| Homo sapiens 5-hydroxytryptamine (serotonin) receptor 2B 
(HTR2B), mRNA 

1 gggggtattt gtttcactgc tttcaaccgc ctgtgctgga ggctcagaat aagtcaatgg 
30 61 gaggaggatt tcagtcacag cagcaagcaa gtctagtgaa cagataagat gacatgctca 
121 gcaaaataac aacgaaacca gagggggaac tctctggcat gcaagttcaa acacgactct 
181 acaactacgg cagaaaaaga gagagagaga aactaaaaat atatatatat cctatttttt 
241 tcacagctat cagtttcttt cactgagctt tcctaaattt aagcctctag aaaataataa 
301 atacttggat atcttaccta caaacatgga cagatgtgtg tatgcgctca ttttagagaa 
35 361 cttgaatttt tttttttaaa ggaaggtgtc aactttggct tttgagtgtt tggcatggtt 
421 acaatgcctt aaaaaaacag atgagcagct tagctactaa ccatgctgac cactgttcgg 
481 aacgggattg aatcacagaa aaacagcaaa tggctctctc ttacagagtg tctgaacttc 
541 aaagcacaat tcctgagcac attttgcaga gcacctttgt tcacgttatc tcttctaact 
601 ggtctggatt acagacagaa tcaataccag aggaaatgaa acagattgtt gaggaacagg 
40 661 gaaataaact gcactgggca gctcttctga tactcatggt gataataccc acaattggtg 
721 gaaataccct tgttattctg gctgtttcac tggagaagaa gctgcagtat gctactaatt 
781 actttctaat gtccttggcg gtggctgatt tgctggttgg attgtttgtg atgccaattg 
841 ccctcttgac aataatgttt gaggctatgt ggcccctccc acttgttcta tgtcctgcct 
901 ggttatttct tgacgttctc ttttcaaccg catccatcat gcatctctgt gccatttcag 
45 961 tggatcgtta catagccatc aaaaagccaa tccaggccaa tcaatataac tcacgggcta 
1021 cagcattcat caagattaca gtggtgtggt taatttcaat aggcattgcc attccagtcc 
1081 ctattaaagg gatagagact gatgtggaca acccaaacaa tatcacttgt gtgctgacaa 
1141 aggaacgttt tggcgatttc atgctctttg gctcactggc tgccttcttc acacctcttg 
1201 caattatgat tgtcacctac tttctcacta tccatgcttt acagaagaag gcttacttag 
50 1261 tcaaaaacaa gccacctcaa cgcctaacat ggttgactgt gtctacagtt ttccaaaggg 
1321 atgaaacacc ttgctcgtca ccggaaaagg tggcaatgct ggatggttct cgaaaggaca 
. 1381 aggctctgcc caactcaggt gatgaaacac ttatgcgaag aacatccaca attgggaaaa 
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1441 agtcagtgca gaccatttcc aacgaacaga gagcctcaaa ggtcctaggg attgtgtttt 
1501 tcctcttttt gcttatgtgg tgtcccttct ttattacaaa tataacttta gttttatgtg 
1561 attcctgtaa ccaaactact ctccaaatgc tcctggagat atttgtgtgg ataggctatg 
1621 tttcctcagg agtgaatcct ttggtctaca ccctcttcaa taagacattt cgggatgcat 
5 1681 ttggccgata tatcacctgc aattaccggg ccacaaagtc agtaaaaact ctcagaaaac 
1741 gctccagtaa gatctacttc cggaatccaa tggcagagaa ctctaagttt ttcaagaaac 
1801 atggaattcg aaatgggatt aaccctgcca tgtaccagag tccaatgagg ctccgaagtt 
1861 caaccattca gtcttcatca atcattctac tagatacgct tctcctcact gaaaatgaag 
1921 gtgacaaaac tgaagagcga gttagttatg tatagcagaa ctggcagttg tcatcaaaca 
10 1981 taatgatgag taagatgatg aatgagatgt aaatgtgcca agaatatatt atataaagaa 
2041 ttttatgtca tatatcaaat catctcttta acctaagatg taagtattaa gaatatctaa 
2101 ttttcctaat ttggacaaga ttattccatg aggaaaataa ttttatatag ctacaaatga 
2161 aaacaatcca gcactctggt taaattttaa ggtattcgaa tgaaataaag tcaaatcaat 
2221 aaatttcagg ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa 

15 SEQIDNO:148 

Amino acid sequence of human HTR2B encoded by the DNA sequence shown in SEQ ID 
NO: 147. 

MALSYRVSELQSTIPEHILQSTFVHVISS1WSGLQTBSIPEEMKQIVEEQGNKLHWAALL 
ILMVIIPTIGGNTLVILAVSLEKKLQYATNYFLMSIAVADLLVGLFVMPIALLTIMFEAM 

20 WPLPLVLCPAWLPLDVLFSTASIMHLCAISVDRYIAIKKPIQANQYNSRATAFIKITVVW 
LISIGIAIPVPIKGIETDVDNPimiTCVLTKERPGDPMLFGSLAAPPTPLAIMIVTYFXiT 
IHALQKKAYLVKNKPPQRLTWLTVSTVPQRDETPCSSPEKVAMLDGSRKDKALPNSGDET 
LMRRTSTlGKKSVQTISNEQRASKVLGIVPFLFLIiMWCPFFITNITLVLCDSCNQTTLQM 
LLEIF^IGYVSSGVNPLVYTLFNKTFRDAFGRYITCNYRATKSVKTLRKRSSKIYFRNP 

25 MAENS KF FKKHG I RNG INPAMYQS PMRLRS STIQSS S 1 1 LLDTLLIjTENEGDKTEERVS Y 
V 

SEQ ID NO: 149 

gi|6680322|ref]NM_0083 U .1 1 Mus musculus 5-hydroxytryptamine (serotonin) receptor 2B 
(Htt2b), mRNA 

30 1 actgtctgga actggactga gtcaccaaaa ggcgaatggc ttcatcttat aaaatgtctg 

61 aacaaagcac aacttctgag cacattttac agaagacatg tgatcacctg atcctgacta 
121 accgttctgg attagagaca gactcagtag cagaggaaat gaagcagact gtggagggac 
181 aggggcatac agtgcactgg gcagctctcc tgatactege ggtgataata cccaccattg 
241 gtgggaacat ccttgtgatt ctggctgttg cactggagaa aaggctgcag tacgctacca 
35 301 actacttttt aatgtccttg gcgatagcag atttgctggt tggattgttt gtgatgeega 
361 ttgccctctt gacaatcatg tttgaggcta tatggcccct cccactggcc ctgtgtcctg 
421 cctggttatt cctcgatgtt ctcttttcaa ctgcctccat catgcatctc tgtgccattt 
481 ccctggaccg etatatagee atcaaaaagc caattcaggc caatcagtgc aacacccggg 
541 etactgeatt catcaagatt acagtggtat ggttaatttc aataggcatc gccatcccag 
40 601 tccctattaa aggaatcgag actgatgtga ttaatccaca caatgtcacc tgtgagctga 
661 caaaggaccg ctttggcagt tttatggtct ttgggtcact ggctgettte ttcgtacctc 
721 tcaccatcat ggtagtcact tactttctca ccattcacac tttacagaag aaagcttact 
781 tggtcaaaaa taagccacct caacgcctaa cacggtggac tgtgcccaca gttttcctaa 
841 gggaagactc atccttttca tcaccagaaa aggtggcaat gctggatggg tctcacaggg 
45 901 ataaaattct acctaactca agtgatgaga cacttatgeg aagaatgtcc tcagttggaa 
961 aaagatcagc ccaaaccatt tctaatgagc agagagcetc gaaggecett ggagtcgtgt 
1021 ttttcctttt tetgettatg tggtgcccct ttctfcattac aaatctaact ttagctctgt 
1081 gtgattcctg caatcagacc actctcaaaa cactcctgga gatatttgtg tggataggct 
1141 acgtttcctc gggggtgaat cctctgatct atacactctt caataagaca tttcgggaag 
50 1201 catttggcag gtacatcacc tgeaattace gagecacaaa gtcagtaaaa gcacttagga 
1261 agttttccag tacactttgt tttgggaatt caatggtaga aaactctaaa tttttcacaa 
1321 aacatggaat tcgaaatggg atcaaccctg ccatgtacca gageccaatg aggctccgat 
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1381 gttcaaccat tcagtcctca tcaatcatcc tcctcgatac ccttctcact gaaaacgatg 
1441 gcgacaaagc ggaagagcag gtcagctaca tattgcagga acgggccggc ctcatcttga 
1501 gagagggtga tgagcaggac gcacgcgcac catggcaggt tcaagagtga 

SEQIDNO:150 

5 Amino acid sequence of mouse HTR2B encoded by the DNA sequence shown in SEQ ID 
NO: 149. 

MASS YKMS EQSTTSEHI LQKTCDHL I LTNRSGLETDS VAEEMKQTVEGQGHTVHWAALLI 
IAVIIPTIGGNILVIIAVALBKRLQYATNYFLMSIAIADLLVGLFVMPIALLTIMFEAIW 
PLPI*ALCPAWLFI^VLFSTASIMHLCAISLDRYIAIKKPIQANQCNTRATAPIKIT\AA^ 
10 I S IGI AZ PVP I KG I ETDVINPHNVTCBLTKDRFGS FMVFGS LAAFF VPLT I MWTYFLT I 
HTLQKKAYLVKNKPPQRWRWTVPTVFLREDSSFSSPEKVAMLDGSHRDKILPNSSDETL 
MRRMSSVGKRSAQTISNEQRASKALG\nTCFLFL^^ 

LEIFWIGWSSGVNPLIYTLFmCTFREAFGRYITCNYRATKSVKALRKFSSTLCFGNSM 
VEMSKFFTKHGIRNGINPAMYQSPMRLRCSTIQSSSIILLDTLLTENDGDKAEEQVSYIL 
15 QERAGLILREGDEQDARAPWQVQE 

SEQ ID NO: 151 

gi|8393585|ref]NM_01 7250.1 1 Rattus norvegicus 5-hydroxytryptamine (serotonin) receptor 
2B (Htr2b), mRNA 

1 ctgaaatcta agcctctaga aggactagaa tctggatgtc ttacctgcaa acatggacag 
20 61 atatgtacac agtcccatct tggagaacct gaatcttttt agaagaaaga aggccacctt 
121 ggctgggagt gtctggagga taccatgctt tgcaaaagca gatgacctgc tagcaactga 
181 ccatgctgac cactgtctgg aactggactg agtcacagaa aggcgaatgg cttcatctta 
241 taaaatgtct gaacaaagca caatttctga gcacattttg cagaaaacat gtgatcacct 
301 gatcttgact gaccgttctg gattaaaggc agaatcagca gcagaggaaa tgaagcagac 
25 361 tgccgagaac caggggaata cagtgcactg ggcagctctc ctgatcttcg cggtaataat 
421 ccccaccatt ggcgggaaca tcctggttat tctggctgtt tcactggaga aaaggctgca 
481 gtacgctacc aactactttc taatgtcctt ggcggtggct gatttgctgg ttggattgtt 
541 tgtgatgccg attgctctct taacaatcat gtttgaggct acatggcccc tcccactggc 
601 cctgtgtcct gcctggttat tccttgatgt tctcttttca actgcctcca tcatgcatct 
30 661 ctgtgccatt tccctggatc gctatatagc catcaaaaag ccaattcagg ccaatcagtg 
721 caattcccgg actactgcat tcgtcaagat tacggtggta tggttaattt caataggcat 
781 cgccatccca gtccctatta aaggaataga ggctgatgtg gtcaacgcac acaacatcac 
841 ctgtgagctg acaaaggacc gctttggcag tttcatgctc tttgggtcac tggctgcttt 
901 ctttgcacct ctcaccatca tgatagtcac ctactttctc accattcacg ctttgcggaa 
35 961 gaaagcttac ttggtcagaa acaggccacc tcaacgccta acacggtgga ctgtgtccac 
1021 agttctccaa agggaagact catccttttc atcaccagaa aagatggtga tgctggatgg 
1081 ctctcacaag gataaaattc tacctaactc aactgatgag acactgatga gaagaatgtc 
1141 ctcagcagga aaaaaaccag cccagaccat ttctaatgaa cagagagcct caaaggtcct 
1201 tggaattgtg tttctcttct ttctgcttat gtggtgcccc tttttcatta caaacgtaac 
40 1261 tttagctctg tgtgattcct gcaaccagac tactctcaaa acactcctgc agatatttgt 
1321 gtgggtaggc tacgtttcct cgggagtgaa tcctttgatc tataccctct tcaataagac 
1381 atttcgggaa gcatttggca ggtacatcac ctgcaattac caggccacaa agtcagtaaa 
1441 agtgcttaga aagtgttcta gtacactcta ttttgggaat tcaatggtag aaaactctaa 
1S01 atttttcaca aaacatggaa ttcgaaatgg gatcaaccct gccatgtacc agagcccagt 
45 1561 aaggctccga agttcaacca ttcagtcttc atccatcatt ctcctcaata catttctcac 
1621 tgaaaacgat ggtgacaaag tagaagacca agtcagctac atatagtgga atggggcagc 
1681 cctcatctga ctgagggagg ggatgaggag gacgcaagca aaccaaggaa aaggcaagag 
1741 tgaagcacta aggttgtcca gtttccttat ctaaacaaac tcaacgcacg ggtatagtag 
1801 ttccgtatgg ctacaaacaa aagcattccc tactctggta ttcaaatgga acaaaattaa 
50 1861 ataagtggat atacttcagt ccttaaaaag aaaagaaggg gttggggatt tagctcagtg 
1921 gtagagagtt tgcctagtaa gtgcaaggcc ctgggttcgg ttctcagctc cagaaaaaaa 
1981 aaaaataaaa aaaaaaaata aataaaaaaa aaaaaaaaaa a 
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SEQ ID NO: 152 

Amino acid sequence of rat HTR2B encoded by the DNA sequence shown in SEQ ID NO: 
151. 

f4ASSYKMSEQSTISEHILQKTCDHLILTDRSGLKAESAAEEMKOTAENQGin , VHWAALLI 
5 FAVIIPTIGGNILVIIAVSLEKRLQYATNYPLMSIAVADLLVGLFVMPIALLTIMFEATW 
PLPLALCPAWLFLDVLPSTASIMHLCAISI^RYIAIKKPIQANQCNSRTTAFVKITVVWL 
ISIGIAIPVPIKQIEADVVNAHNITCELTKDRFGSFMLFGSLAAPPAPIiTIMIVTYFLTI 
HALRKKAYLVRNRPPQRLTRWTVSTVLQREDSSFSSPBKM 
MRRMSSAGKKPAQTISNBQRASKVWIVFLFFIiLMWCPFFITNVTlJUiCDSCN^ 
1 0 LQI FVWVGYVSSGVNPLI YTLFNKTFREAFGRYITCNYQATKSVKVLRKCSSTLYFGNSM 
VENSKFFTKHGIRNGINPAMYQSPVRLRSSTIQSSSIILLNTFLTENDGDKVEDQVSYI 

SEQ ID NO: 153 

gi|4557885|ref|NM_000211.1| Homo sapiens integrin, beta 2 (antigen CD 18 (p95), 
lymphocyte function-associated antigen 1; macrophage antigen 1 (mac-1) beta subunit) 
15 (ITGB2), mRNA 

1 cagggcagac tggtagcaaa gcccccacgc ccagccagga gcaccgccgc ggactccagc 
61 acaccgaggg acatgctggg cctgcgcccc ccactgctcg ccctggtggg gctgctctcc 
121 ctcgggtgcg tcctctctca ggagtgcacg aagttcaagg tcagcagctg ccgggaatgc 
181 atcgagtcgg ggcccggctg cacctggtgc cagaagctga acttcacagg gccgggggat 
20 241 cctgactcca ttcgctgcga cacccggcca cagctgctca tgaggggctg tgcggctgac 
301 gacatcatgg accccacaag cctcgctgaa acccaggaag accacaatgg gggccagaag 
361 cagctgtccc cacaaaaagt gacgctttac ctgcgaccag gccaggcagc agcgttcaac 
421 gtgaccttcc ggcgggccaa gggctacccc atcgacctgt actatctgat ggacctctcc 
481 tactccatgc ttgatgacct caggaatgtc aagaagctag gtggcgacct gctccgggcc 
25 541 ctcaacgaga tcaccgagtc cggccgcatt ggcttcgggt ccttcgtgga caagaccgtg 
601 ctgccgttcg tgaacacgca ccctgataag ctgcgaaacc catgccccaa caaggagaaa 
661 gagtgccagc ccccgtttgc cttcaggcac gtgctgaagc tgaccaacaa ctccaaccag 
721 tttcagaccg aggtcgggaa gcagctgatt tccggaaacc tggatgcacc cgagggtggg 
781 ctggacgcca tgatgcaggt cgccgcctgc ccggaggaaa tcggctggcg caacgtcacg 
30 841 cggctgctgg tgtttgccac tgatgacggc ttccatttcg cgggcgacgg aaagctgggc 
901 gccatcctga cccccaacga cggccgctgt cacctggagg acaacttgta caagaggagc 
961 aacgaattcg actacccatc ggtgggccag ctggcgcaca agctggctga aaacaacatc 
1021 cagcccatct tcgcggtgac cagtaggatg gtgaagacct acgagaaact caccgagatc 
1081 atccccaagt cagccgtggg ggagctgtct gaggactcca gcaatgtggt ccatctcatt 
35 1141 aagaatgctt acaataaact ctcctccagg gtcttcctgg atcacaacgc cctccccgac 
1201 accctgaaag tcac'ctacga ctccttctgc agcaatggag tgacgcacag gaaccagccc 
1261 agaggtgact gtgatggcgt gcagatcaat gtcccgatca ccttccaggt gaaggtcacg 
1321 gccacagagt gcatccagga gcagtcgttt gtcatccggg cgctgggctt cacggacata 
1381 gtgaccgtgc aggttcttcc ccagtgtgag tgccggtgcc gggaccagag cagagaccgc 
40 1441 agcctctgcc atggcaaggg cttcttggag tgcggcatct gcaggtgtga cactggctac 
1501 attgggaaaa actgtgagtg ccagacacag ggccggagca gccaggagct ggaaggaagc 
1561 tgccggaagg acaacaactc catcatctgc tcagggctgg gggactgtgt ctgcgggcag 
1621 tgcctgtgcc acaccagcga cgtccccggc aagctgatat acgggcagta ctgcgagtgt 
1681 gacaccatca actgtgagcg ctacaacggc caggtctgcg gcggcccggg gagggggctc 
45 1741 tgcttctgcg ggaagtgccg ctgccacccg ggctttgagg gctcagcgtg ccagtgcgag 
1801 aggaccactg agggctgcct gaacccgcgg cgtgttgagt gtagtggtcg tggccggtgc 
1861 cgctgcaacg tatgcgagtg ccattcaggc taccagctgc ctctgtgcca ggagtgcccc 
1921 ggctgcccct caccctgtgg caagtacatc tcctgcgccg agtgcctgiia gttcgaaaag 
1981 ggcccctttg ggaagaactg cagcgcggcg tgtccgggcc tgcagctgtc gaacaacccc 
50 2041 gtgaagggca ggacctgcaa ggagagggac tcagagggct gctgggtggc ctacacgctg 
2101 gagcagcagg acgggatgga ccgctacctc atctatgtgg atgagagccg agagtgtgtg 
2161 gcaggcccca acatcgccgc catcgtcggg ggcaccgtgg caggcatcgt gctgatcggc 
2221 attctcctgc tggtcatctg gaaggctctg atccacctga gcgacctccg ggagtacagg 
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2281 cgctttgaga aggagaagct caagtcccag 

2341 gccaccacga cggtcatgaa ccccaagttt 

2401 aggccgtcag gacccaccat gtctgcccca 

2461 ctcttgagga tgtcaccaat taaccagaaa 

5 2521 ccatggccgg ccggtgcttc tgggggctcg 

2581 agtctttgca tggagacttg aggagggctt 

2641 gtgcaagtca ggacatcagt ctgattaaag 

2701 tcagggtata aaatgacatc ccattaatta 

2761 aaaataaaac ttcaat 



tggaacaatg ataatcccct tttcaagagc 
gctgagagtt aggagcactt ggtgaagaca 
tcacgcggcc gagacatggc ttggccacag 
tccagttatt ttccgccctc aaaatgacag 
tcggggggac agctccactc tgactggcac 
gaggttggtg aggttaggtg cgtgtttcct 
gtggtgccaa tttatttaca tttaaacttg 
tattgttaat caatcacgtg tatagaaaaa 



10 SEQ ID NO: 154 



Amino acid sequence of human ITGB2 encoded by the DNA sequence shown in SEQ ID 
NO: 153. 

MLGLRPPLIJUjVGLLSIiGCJVLSQECTKFKVSSCRECIESGPGCTWCOKLNFTGPGDPDSI 
RCDTR PQLLM RGC AAPD I MD PTS LAETQEDHNGGQKQL S PQKVTLYLRPGQAAAFNVTFR 

1 5 RAKGYPIDLYYI^LSYSMLDDLRNVKKLGGDLLRTVLNEITESGRIGFGSFVDKTVLPFV 
NTHPDKLRNPCPNKEKECQPPFAFRHVLKLTNNSNQFQTEVGKQLISGNLDAPEGGLDAM 
MQVAACPEElGWRNVTRLLVFATDDGFHFAGDGKLGAIIiTPNDGRCHLEDNLYKRSNEFD 
YPS VG QLAHKLAENN I QP I FAVTS RMVKTYEKLTE IIP KS AVGE LS EDS SNWHL I KNAY 
NKLSSRVFLDHNALPDTL KVTYDS FCSNGVTHRNQPRGDCDGVQINVP ITFQVKVTATEC 

20 IQEQSFVIRALGFTDIVTVQVLPQCECRCRDQSRDRSLCHGKGFLECGICRCDTGYIGKN 
CECQTQGRSSQELEGSCRKDNNSIICSGLGDCVCGQCLCHTSDVPGKLIYGQYCECDTIN 
CBRYNGQVCGGPGRGLCFCGKCRCHPGFEGSACQCERTTEGCLNPRRVECSGRGRCRCNV 
CECHSGYQLPLCQECPGCPSPCGKYISCAECLKFEKGPFGKNCSAACPGLQLSNNPVKGR 
TCKERDSEGCWVAYTLEQQDGMDRYLIYVDESRECVAGPNIAAIVGGTVAGIVLIGILLL 

25 VIWKALIHLSDLREYRRFEKEKLKSQWNNDNPLFKSATTTVMNPKFAES 



SEQ ID NO: 155 



gi|6680489|ref]NM_0084041| Mus musculus integrin beta 2 (Itgb2), mRNA 

1 atgctgggcc tgcgcccctc actgctgctt gccctagctg gactgttctt cctgggatct 
61 gctgtgtccc aggaatgcac caagtacaaa gtcagcagtt gccgggactg tatccagtcg 
30 121 gggcctggct gttcctggtg ccagaagctg aacttcactg gaccaggaga acctgactcc 
181 ttgcgctgtg acacacgggc acagctgctg ctgaagggtt gtccagccga tgatatcatg 
241 gaccccagga gcatcgctaa tcctgagttc gaccaacggg ggcaacggaa acagctatct 
301 ccacaaaaag tgacacttta cttgcgacca ggacaggctg ccgcattcaa tgtgactttc 
361 cggcgggcca agggataccc cattgatctg tactacctca tggatctctc ctactccatg 
35 421 cttgatgacc tcaacaacgt caagaagctg ggcggggact tgctgcaggc cctcaacgag 
481 atcaccgagt ctggccgcaf. cggctttggg tcgtttgtgg acaagacggt gctgcctttt 
541 gttaacaccc atcctgagaa gctgaggaac ccatgtccca acaaggagaa ggcctgccag 
601 cccccatttg cctttcggca cgtgctcaag ttaaccgaca actccaacca gtttcagaca 
661 gaggtcggca agcaactgat ttccggaaac ctggacgccc ctgagggtgg gctggatgcc 
40 721 ataatgcaag ttgctgcatg tccggaggaa attggctggc gcaatgtcac gaggctgctg 
781 gtgtttgcca cagacgatgg cttccacttt gctggtgatg gcaaactggg tgccatcctg 
841 acccccaatg atggccgctg ccacctggag gataacatgt acaagaggag caatgagttc 
901 gactacccat ccgtgggtca gctggcacac aaactttccg agagcaacat ccagcccatc 
961 tttgcggtga caaagaagat ggtgaaaacg tatgagaaac tcacggagat catccccaag 
45 1021 tcagcagtgg gggaactgtc tgacgactcc agcaacgtgg tgcagctcat caagaatgcc 
1081 tactataaac tctcctctag agtcttcctg gaccacagca ccctcccgga caccctgaaa 
11.41 gtcacctatg actccttctg cagtaatgga gcatcgagta taggcaaatc ccgtggggac 
1201 tgtgatggcg tacagatcaa caacccggtc accttccagg taaaggtcat ggcttccgag 
1261 tgtatccagg agcagtcctt tgtcatccgg gcactgggtt tcacggatac agtgaccgtg 
50 1321 caggtccgtc cccagtgtga gtgtcactgc cgggaccaga gtcgggagca gagtctctgt 
1381 ggaggcaagg gagtcatgga gtgtggtatc tgcaggtgtg agtctggcta cattgggaaa 
1441 aactgtgagt gccagactca gggtcggagc agccaggagc tggagagaaa ctgtcggaag 
1501 gacaatagtt ccatcgtgtg ctcagggctt ggggactgca tctgtgggca gtgtgtatgc 
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1561 cataccagtg acgtccccaa caaagagatc tttgggcaat actgcgagtg tgacaatgtc 
1621 aactgtgaga gatataacag ccaagtctgc ggtggctcag atcggggttc ctgcaactgt 
1681 ggcaaatgta gttgcaagcc cggttacgag ggctcggcct gccagtgtca gaggtccacc 
1741 acgggctgtc tgaatgcacg gctggtagag tgcagtggcc gtggccactg ccaatgcaac 
5 1801 aggtgcatat gtgacgaagg ctaccagcca ccgatgtgtg aggattgtcc cagctgtggc 
1861 tcgcactgca gggacaacca cacctcttgt gccgagtgcc tgaagtttga taagggccct 
1921 tttgagaaga actgtagtgt tcagtgtgct ggtatgacgc tgcagactat ccctttgaag 
1981 aaaaagccct gcaaggagaa ggactcggaa ggctgttgga taacttacac tttgcagcag 
2041 aaggacggaa ggaacattta caacatccat gtggaggaca gtctagagtg tgtgaagggc 

10 2101 cccaatgtgg ctgccatcgt agggggcacc gtggtaggtg tcgtactgat tggtgtcctc 
2161 ctcctggtca tctggaaggc cctgacccac ctgactgacc tcagggagta caggcgcttt 
2221 gagaaggaga aactcaagtc ccaatggaac aatgacaacc ccctcttcaa gagtgctacg 
2281 acaacggtca tgaaccccaa gtttgctgaa agctagagca tgagttatca taatcaagca 
2341 gatgtgaccc cctcagacca cgcctcctcc cctctgcaaa cacaacgtgg cttacagctc 

IS 2401 accccagtgc tgccaaggat ccaaaagcct gctcggtttc ttnccgccat tatatcaag 



SEQIDNO: 156 

Amino acid sequence of mouse ITGB2 encoded by the DNA sequence shown in SEQ ID NO: 
155. 

20 MXiGLRPSLLLALAGLFFLGSAVSQBCTKYKVSSCRDCIQSOPGCSWCQKLNFTOPOEPDS 
LRCDTI^OLLLKGCPADDIMDPRSIANPEFDQRGQRKQLSPQKVTLYX«RPGQAAAFNVTF 
RRAKGYPIDLYYLMDLSYSMLDDLNNVKKLGGDLLQALNEITESGRIGFGSFVDKTVLPF 
VNTHPEKLRNPCPNKEKACQPPFAFRHVLKLTDNSNQFQTEVGKQLISGNLDAPEGGLDA 
I MQ VAACP E E I GWRNVTRLLVFATDDGFHFAGDGKLGAI LTPWDGRCHLEDNMYKR S NE F 

25 DYPSVGQLAHKLSESNIQPIFAVTKKMVKTYEKLTEIIPKSAVGELSDDSSNVVQLIKNA 
YYKLSSRVFLDHSTLPDTLK\TTYDSFCSNGASSIGKSRGDCDGVQINNPVTFQVKVMASE 
CIQEQSFVIRAIiGFTDTVTVQVRPQCECHCRDQSREQSLCGGKGVMECGICRCESGYIGK 
NCECQTQGRSSQELERNCRKDNSSIVCSGLGDCICGQCVCHTSDVPNKEIFGQYCECDNV 
NCERYNSQVCGGSDRGSCNCGKCSCKPGYEGSACQCQRSTTGCI^NARLVECSGRGHCQCN 

30 RCICDEGYQPPMCEDCPSCGSHCRDNHTSCAECLKFDKGPFEKNCSVQCAGMTLQTIPIiK 
KKPCKEKDSEGCWITYTLQOKDGRNIYNIHVEDSLECVKGPOTAAIVGGTWGWLIGVL 
LLVIWKALTHLTDLREYRRFEKEKLKSQWNNDNPLFKSATTTVMNPKFAES 

SEQ ID NO: 157 

gi|34852368|ref|XM_228072.2| Rattus norvegicus similar to Integrin beta-2 precursor (Cell 
35 surface adhesion glycoproteins LFA-1/CR3/P150,95 beta-subunit) (CD18) (Complement 
receptor C3 beta-subunit) (LOC309684), mRNA 

1 atggatggct ttggttgtca tgtaacagga gccctgtccg aggaatgcac caagtacaaa 
61 gtcagcaact gccgggactg tatccagtcg gggcctggct gctcgtggtg ccagaagctg 
121 aacttcaccg gaccagggga gcctgactcc ttgcgctgtg acacgcgggc acagctgctc 

40 181 ctcaagggtt gcccagccga tgatataatg gaccccaaga gctttgctga tctccacccc 
241 caatatcagg tgcaacggag tcaactgtct ccacaaaaag tgacccttaa cttgcgacca. 
301 gggcaggctg ctgcattcaa tgtgactttc cgacgggcca agggctaccc cattgatctg 
361 tactacctca tggacctctc ctactctatg ctcgatgacc tgaacaatgt caagaagttg 
421 ggtggtgatt tgctgcaggc cctcaacgag atcacagagt ccggccgcat cggcttcggg 

45 481 tccttcgtgg acaagacggt gctgcctttt gtcaacaccc atcccgagaa gctgaggaac 
541 ccatgcccca acaaggagaa agcctgccag cctccgtttg cctttcgcca cgtgctcaag 
601 cr.aaccgaca actccaacca gtttcagaca gaggtcggca agcaactgat ttccggaaac 
661 ctggacgccc ctgagggcgg gctggatgcc ataatgcaag tggctgcctg tccgttctcc 
721 tgtgagccag gctgcccagc agtggatctt ctggttccta aggccttttc cctctcagat 

50 781 caagagcacg acggcttcca ctttgccggt gatgggaaac tgggtgccat cctgaccccc 
841 aacgatggcc gctgccacct ggaggataac atgtacaaga ggagcaatga gttcgactac 
901 ccgtcagtgg gccagctggc ccacaaactt tccgagagca acatccagcc catctttgca 
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961 gtgacaaaga agatggtgaa aacctatgag aaactgacag agattatccc caagtcagcg 
1021 gtgggcgagc tgtctgacga ttccagtaac gtagtccagc ttatcaagaa agcctactac 
1081 aaactctcct ctagagtctt cctggaccac accaccatcc cggacaccct gaaagtcacc 
1141 tatgactcct tctgtaataa cagagtatcg agtataggca aatcccgagg ggactgtgac 
S 1201 ggtgtgcaga tcaacaaccc ggtcaccttc caggtaaagg tcacggcttc ggagtgtatc 
1261 caggagcagt cctttgtcat ccgggcgctg ggcttcaccg acacagtgac ggtacaggtc 
1321 catccccagt gcgagtgcca gtgccgggac cagagtcgga tgaggaatct ctgtggaggc 
1381 aagggagtca tggagtgtgg catctgcagg tgtgagtctg gctacattgg gaaaaactgt 
1441 gagtgtcaga cgcagggccg gagcagccag gagctggagg ggaactgccg gaaggacaat 

10 1501 agttccattg tgtgctcggg gctgggggac tgcatctgcg ggcagtgcgt gtgccacacg 
1561 agtgacatcc ccaacaaagt gatctttggg caatactgcg agtgtgacaa cttcaactgt 
1621 gagagatatg atggccaagt ctgcggtggc ctaaagagag gcfccctgctc ctgtggccag 
1681 tgtaattgca aggagggttt cgagggttct gcttgccagt gtcagaggtc taccacgggc 
1741 tgtctgaacg cacggctggt ggagtgcagt ggccgtggcc ggtgccaatg caacagatgc 

15 1801 atctgtgaga aaggttacca gccacctctg tgtgaagagt gtcccggctg ccccttgccc 
1861 tgcagcacct acgtcttctg tgccgagtgc ctgaaatttg ataagggccc ctttcagaag 
1921 aattgtagtg ttcagtgtgc caatgtgacg ctgcagactg tccctttcaa gaaaaagccc 
1981 tgcaaggaga gggactcgga gggctgctgg ataacctaca ctttgcagca gaaggacgga 
2041 aacgcttaca acatccatgt ggacgacgat cgagagtgtg tgaaaggccc caacgtggct 

20 2101 gccatcatag ggggcactgt ggccggcgtt gtactgattg gtgtcctcct cctggtcatc 
2161 tggaaggctc ttacccacct gactgacctc aatgaataca gacgctttga gaaggagaaa 
2221 ctcaagtccc agtggaacaa cgacaacccc ctcttcaaga gcgccacgac aacggtcatg 
2281 aaccctaagt ttgctgagag ctagagaagg agtcagagga gacccctcca gaccatgcct 
2341 cctcccctct gcaaatagaa tgtagcttac agctagcccc agtgctgcca aggatccaaa 

25 2401 agcctacttt gtttctttcc gccattatat caaggctgcc agggtttcca cagactcatc 
2461 ttccgaccta tacaatcttg ccacagagcc tgcagattgt tccggagtcc caagaggttc 
2521 cacacacgtt ttcgtgcata aagggaagac aggggtctca gtaaaggtgg ccccagctta 
2581 tttatattta aacttgttag cgtataaaac tactattata ttgttaacat cctgtccgtt 
2641 gtattatatg tgagtgtaaa actatatccc acatatatca gaatcatgtg tgtaaaaata 

30 2701 ataaagcttc cattcagggc tgcagagatg gctcagtggt taagagcact gactgctctt 
2761 ccagaggtcc tgagttcaat tcccagcatc cacatggtgg ctcacaacca tctgtaatgg 
2821 gatctgatgc ccttgtctgg tgtggctgaa gatagcaaca gtgtactcac atacataaaa 
2881 taaataaagc cttttaataa aaaaattaat aaagcttcca 

SEQE>NO:158 

35 Amino acid sequence of rat ITGB2 encoded by the DNA sequence shown in SEQ ID NO: 
157. 

MDGFGCHVTGALSEECTKYKVSNCRDCIQSGPOCSWCQKLNFTGPGEPDSLRCDTRAQLL 
LKGCPADDI^PKSPADLHPQYQVQRSQLSPQKVTLNLRPGQAAAFNVTFRRAKGYPIDL 
YYIiMDLSYSMI*DDLNNVKKLGGDLLQALNEITESGRIGFG 

40 PCPNKEKACQPPFAFRHVLKLTDNSNQFQTEVQKQLISGNLDAPEGGLDAIMQVAACPFS 
CEPGCPAVDLLVPKAFSLSDQEHDGFHFAGDGKLGAILTPNDQRCHLEDNMYKRSNEFDY 
PSVGQUmKLSESNIQPIFAVTKKMVKTYEKLTEIIPKSAVGELSDDSSNWQLIKKAYY 
KLS SRVFLDHTTI PDTLKVTYDS FCNNRVSS IGKSRGDCDGVQINNPVTFQVKVTASECI 
QEQS F VI RALG FTDTVTVQVHPQCECQCRDQSRMRNLCGGKG VMECG I CRCESG Y IG KNC 

45 ECQTQGRSSQELEGNCRKDNSSIVCSGLGDCICGQCVCHTSDIPNKVIFGQYCECDNFNC 
ERYDGQVCGGIiKRGSCSCGQCNCKEGFEGSACQCQRSTTGCLNARLVECSGRGRCQCNRC 
ICEKGYQPPLCEECPGCPLPCSTYVFCAECLKFDKGPFQKNCSVQCANVTLQTVPFKKKP 
CKERDSEGCWZTYTLQQKDGNAYKIHVDDDRECVKGPNVAAIIGGTVAGVVLIGVIiLLVI 
WKAJjTHLTDLNEYRRFB KEKLKS QWNNDNPL FKS ATTTVMNPKFAES 

50 SEQ ID NO: 159 — 

gi|20127646|ref]NM 030569.2) Homo sapiens inter-alpha (globulin) inhibitor H5 (ITIH5), 
mRNA 
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1 gtgtcccgcc gggtccccga gcgtcccgcg ccctcgcccc gccatgctcc tgctgctggg 
61 gctgtgcctg gggctgtccc tgtgtgtggg gtcgcaggaa gaggcgcaga gctggggcca 
121 ctcttcggag caggatggac tcagggtccc gaggcaagtc agactgttgc agaggctgaa 
181 aaccaaacct ttgatgacag aattctcagt gaagtctacc atcatttccc gttatgcctt 
241 cactacggtt tcctgcagaa tgctgaacag agcttctgaa gaccaggaca ttgagttcca 
301 gatgcagatt ccagctgcag ctttcatcac caacttcact atgcttattg gagacaaggt 
361 gtatcagggc gaaattacag agagagaaaa gaagagtggt gatagggtaa aagagaaaag 
421 gaataaaacc acagaagaaa atggagagaa ggggactgaa atattcagag cttctgcagt 
481 gattcccagc aaggacaaag ccgccttttt cctgagttat gaggagcttc tgcagaggcg 
541 cctgggcaag tacgagcaca gcatcagcgt gcggccccag cagctgtccg ggaggctgag 
601 cgtgggcgtg aatatcctgg agagcgcggg catcgcatcc ctggaggtgc tgccgcttca 
661 caacagcagg cagaggggca gtgggcgcgg ggaagatgat tctgggcctc ccccatctac 
721 tgtcattaac caaaatgaaa catttgccaa cataattttt aaacctactg tagtacaaca 
781 agccaggatt gcccagaatg gaattttggg agactttatc attagatatg acgtcaatag 
841 agaacagagc attggggaca tccaggttct aaatggctat tttgtgcact actttgctcc 
901 taaagacctt cctcctttac ccaagaatgt ggtattcgtg cttgacagca gtgcttctat 
961 ggtgggaacc aaactccggc agaccaagga tgccctcttc acaattctcc atgacctccg 
1021 accccaggac cgtttcagta tcattggatt tcccaaccgg atcaaagtat ggaaggacca 
1081 cttgatatca gtcactqcag acagcatcag ggatgggaaa gtgtacattc accatatgtc 
1141 acccactgga ggcacagaca tcaacggggc cctgcagagg gccatcaggc tcctcaacaa 
1201 gtacgtggcc cacagtggca ttggagaccg gagcgtgtcc ctcatcgtct tcctgacgga 
1261 tgggaagccc acggtcgggg agacgcacac cctcaagatc ctcaacaaca cccgagaggc 
1321 cgcccgaggc caagtctgca tcttcaccat tggcatcggc aacgacgtgg acttcaggct 
1381 gctggagaaa ctgtcgctgg agaactgtgg cctcacacgg cgcgtgcacg aggaggagga 
1441 cgcaggctcg cagctcatcg ggttctacga tgaaatcagg accccgctcc tctctgacat 
1501 ccgcatcgat tatcccccca gctcagtggt gcaggccacc aagaccctgt tccccaacta 
1561 cttcaacggc tcggagatca tcattgcggg gaagctggtg gacaggaagc tggatcacct 
1621 gcacgtggag gtcaccgcca gcaacagtaa gaaattcatc atcctgaaga cagatgtgcc 
1681 tgtgcggcct cagaaggcag ggaaagatgt cacaggaagc cccaggcctg gaggcgatgg 
1741 agagggggac cccaaccaca tcgagcgtct ctggagctac ctcaccacaa aggagctgct 
1801 gagctcctgg ctgcaaagtg acgatgaacc ggagaaggag cggctgcggc agcgggccca 
1861 ggccctggct gtgagctacc gcttcctcac tcccttcacc tccatgaagc tgagggggcc 
1921 ggtcccacgc atggacggcc tggaggaggc ccacggcatg tcggctgcca tgggacccga 
1981 accggtggtg cagagcgtgc gaggagctgg cacgcagcca ggacctttgc tcaagaagcc 
2041 ataccagcca agaattaaaa tctctaaaac atcagtggat ggtgatcccc actttgttgt 
2101 ggatttcccc ctgagcagac tcaccgtgtg cttcaacatt gatgggcagc ccggggacat 
2161 cctcaggctg gtctctgatc acagggactc tggtgtcaca gtgaacggag agttaattgg 
2221 ggcacccgcc cctccaaatg gccacaagaa acagcgcact tacttgcgca ctatcaccat 
2281 cctcatcaac aagccagaga gatcttatct cgagatcaca ccgagcagag tcatcttgga 
2341 tggtggggac agactggtgc tcccctgcaa ccagagtgtg gtggtgggga gctgggggct 
2401 ggaggtgtcc gtgtctgcca acgccaatgt caccgtcacc atccagggct ccatagcctt 
2461 tgtcatcctc atccacctct acaaaaagcc ggcgcccttc cagcgacacc acctgggttt 
2521 ctacattgcc aacagcgagg gcctttccag caactgccac ggactgctgg gtcagttcct 
2581 gaatcaggat gccagactca cagaagaccc tgcagggccc agccagaacc tcactcaccc 
2641 tctgctcctt caggtgggag aggggcctga ggccgtccta acagtgaaag gccaccaagt 
2701 cccagtggtc tggaagcaaa ggaagattta caacggggaa gagcagatag actgctggtt 
2761 tgccaggaac aatgccgcca aactgattga cggggagtac aaggattacc tggcatccca 
2821 tccatttgac acagggatga cacttggcca gggaatgtcc agggagctct gaagctggca 
2881 gccttaaaga tgcaagtgca tgaaggacag tgatgtgggg aggccgtggg gcagctcttt 
2941 tcatggcttg tacacgcctc agctcctggc aattagctgg actccatgac ccacccctgg 
3001 tgcagcatag atccgacgtc tgtctgggcg aagggtaggg gtgggtaggg gcgggaagcc 
3061 tgagtgcaaa tgtcatttcc ctctactgcc tcttcctgcc tctccccacc ctgcccacat 
3121 ccacagaggg gagagaaggg tcatagctaa atgcaacaaa gtctgtatct tgtcccaacc 
3181 tgcttttctg ttctgttagc atatcataaa gtaagccttt ctggtg 

SEQIDNO:160 

Amino acid sequence of human ITIH5 encoded by the DNA sequence shown in SEQ ID NO: 
159. 



227 



WO 2006/007400 



PCT/US2005/021297 



MLLLLGLCLGLSLCVGSQEEAQSWGHSSEQDGLRVPRQVRLLQRLKTKPIjMTEFSVKSTI 
I SRYAFTTVSCRMLNRAS EDQD I EFQMQ I PAAAFITNFTMLIGDKVYQGE ITEREKKSGD 
RVKEKRNKTTEENGEKGTEIFRASAVIPSKDKAAFFLSYEELLQRRLGKYEHSISVRPQQ 
LSGRLSVGVNILESAGIASLEVLPLHNSRQRGSGRGEDDSGPPPSTVINQNETFANIIFK 
5 PTWQQARI AQNGILGDFI IRYDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNWFVL 
DSSASMVGTKLRQTKDALFTILHDI^PQDRFSIIGFPraiKWKDHLISVTPDSIRDGKV 
YIHHMSPTGGTDINGALQRAIRLLNKYVAHSGIGDRSVSL^ 

NNTREAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIRT 
PLLSDIRIDYPPSSWQATKTIiFPNYFNGSEIIIAGKLVDRKIiDHLHVEVTASNSKKFII 

10 LKTDVPVRPQKAGKDVTGSPRPGGDGEGDPNHIERLWSYLTTKELLSSWLQSDDEPEKER 
LRQRAQALAVSYRFLTPFTSMKLRGPVPRMDGLEEAHGMSAAMGPEPWQSVRGAGTQPG 
PLLKKPYQPRIKISKTS\mQDPHFVVDFPLSRLWCFNIDGQPGDILRLVSDHRDSGVTV 
NGELIGAPAPPNGHKKQRTYLRTITILINKPERSYLEITPSRVIIiDGGORLVLPCNQSVV 
VGSWGLEVSVSANANVTVTIQGSIAFVILIHLYKKPAPFQRHHLGFYIANSEGLSSNCHG 

1 5 LLGQFLNQDARLTEDPAGPSQNLTHPLLLQVGEGPEAVLTVKGHQVPWWKQRKI YNGEE 
QIDCWFARKWAAKLIDGEYKDYIiASHPFDTGMT3W3QGMSREL 

SEQIDNO:161 

Amino acid sequence of human ITIH5, a soluble active secreted form derived fiom SEQ ID 
NO: 160. 

20 QSWGHSSEQDGLRVPRQVRLLQRLKTKPLMTEFS VKST 1 1 SRYAFTTVSCRMLNRASEDQ 
DIEFQMQIPAAAFITNFTMLIGDK\ryQGEITEREKKSGDRVKEKRNKTTEENGEKGTEIF 
RASAVIPSKDKAAFFLSYEELLQRRLGKYEHSISVRPQQLSGRLSVGVNILESAGIASLE 
VLPLHNSRQRGSGRGEDDSGPPPSTVINQNETFANIIFKPTWQQARIAQNGILGDFIIR 
YDVNREQSIGDIQVLNGYFVHYFAPKDLPPLPKNWFVIjDSSASMVGTKLRQTKDALFTI 

25 LHDLRPQDRFSIIGFPNRIKWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAI 
RLLNKYVAHSGIGDRSVSLIVFLTDGKPTVGETHTLKILNNTREAARGQVCIFTIGIGND 
VDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIRTPLLSDIRIDYPPSSWQATKT 
L FPNY FNG S E 1 1 1 AG KL VDRKLDHLHVEVTAS NS KKF 1 1 L KTD VP VR PQKAG KD VTG S PR 
PGODGEGDPNHIERLWSYLTTKELLSSWLQSDDEPEKERLRQRAQALAVSYRFLTPFTSM 

30 KLRG PVPRMDGLEEAHGMSAAMGPEP WQS VRGAGTQPG PLLKKP YQPR I KI S KTS VDGD 
PHFVVDFPLSRLTVCFNIDGQPGDILRLVSDHRDSGVTVNGELIGAPAPPNGHKKQRTYL 
RTITILINKPERSYLEITPSRVILDGGDRLVLPCNQSVWGSWGLEVSVSANANVTVTIQ 
GSIAFVILIHLYKKPAPFQRHHLGFYIANSEGLSSNCHGLLGQFIiNQDARLTEDPAGPSQ 
NLTHPLLLQVGEGPBAVLTVMHQVPVVWKQRKIYNGEEQIDCWFARNNAAKLIDGEYKD 

35 YLASHPFDTGMTLGQGMSREL 

SEQ ED NO: 162 

gi|27369643|ref|NMJ 72471 .1 1 Mus musculus inter-alpha (globulin) inhibitor H5 (ltih5) 
mRNA 



1 gccctgaaaa accttcgctg tcccagagag gtcgcgcgct ctggcttcgc catgctcctg 

40 61 ctgctagggc tgtgcttggg gttgcccctc ttctcagagt cccaggaaga ggcaaggagt 

121 tgggacgaca cctcggagca agtcgtgctc agggtcccca ggcagctcag gctgttgcaa 

181 agactgaaaa ccaagccctt gatggcagag ttttcggtga agtctaccat catttcccgc 

241 tatgccttca ccacggtgtc ctgcaggatg ctcaacagag cttctgaaga ccaggaggct 

301 gagttccaga tgcagattcc agaatcagct ttcatcacca acttcaccat gcttatagga 

45 361 gacagcgtgt atcggggtga aattacacag aaagataaga aaagcagtga gagcgtaaaa 

421 gataaaagga acagaacctc agacgataat gaagaaaacg ggagtgacat gttcaaagcg 

481 tctttagtga ttcccagcaa ggacaaagct gctttcttcc tcagttatga agagcttctg 

541 cagaggaggc tggggaaata cgagcatagc atcagcgtgc gcccccagca gctggtgggg 

601 aggctgactg tggaggtgga catcctggag cgatcgggca tcacatcctt ggaagtgctg 

50 661 cctctccaca acagcaggaa gaagggtagt gggaaggcag aaggtgatgt gggtcctccc 

721 ccttctactc tcatcaacca aaatgagaca tttgccaaag tcatctttaa gcctactgtg 

781 gtccaacaag ctaagattgc ccagaatggg atcttgggag atttcatcgt tcggtatgat 



228 



BNSOOCI& <WO 2006007400A2_I_> 



WO 2006/007400 



PCT/US2005/021297 



841 gtcgagagag agcagaacat tggtgacatc caggttctga atggttattt cgtgcactac 
901 tttgctccta aaaacctccc tcctctaccc aagaacgtgg tcttcgtgct tgacatcagc 
961 gcttccatgg tgggagcgaa actccagcag accagggagg ctctcgtcac aattctcaat 
1021 gacctccgac cccaggatcg cttcaatatc attgggttct ccaatcggat taaaatgtgg 
5 1081 aaggaccact tactaccagt gactcctgac aacattagga atggcaagat ctacatgtac 
1141 catctgtcac ccactggagg cacagacatc aatggggccc ttcaggctgc catcaaactg 
1201 ctcaacaact acgtggccca gaatgacatt gaagaccgaa gtgtgtccct catcatcttc 
1261 ctcaccgatg ggaaacccac ctttggggag accaataccc tcaagattct cagcaacacc 
1321 aaggaagcca ccagaggtca gatctgcatc ttcaccgttg gcattggcga tgacgtggac 

10 1381 ttcaaactgt tagagaaact ctcgctggag aactgcggtc tcacgaggcg tgttcatgag 
1441 gaggacaagg cgggcgcaca gctcatcggg ttct;atgatg agatccggac cccgcttctt 
1501 tctgacatcc gcatagatta tcctcctgac gtagtagagc atgccaccaa gaccctgttc 
1561 cctaactact tcaatggctc tgagatagtg attgcaggga agatggtgga caagaagttt 
1621 gatcaactgc atgtggaggt cactgccagc aacagtaaga aatttgtcat cctaaaaaga 

IS 1681 gacatccccg tggagttccg gaagatgggg aatgatgtct cagtcacacc tggctcagcc 
1741 agagatggcg ggaaggacct aaaccacatt gagcgacttt ggagctacct cactgtgaag 
1801 gagctgctaa gctcctggag gcagagcaac agtgagcaag agaaagagca gctgaggcag 
1861 aaggcccagg acttagcctt gaactatcat ttcctcaccc ccttcacctc catgaagctg 
1921 aggaagccag ggctccgcac aaaccagctg gaggacacct atggcatgtc tgcagctaca 

20 1981 ggacctgcga ccgtggtgca gaaccttcga gaggccggca agcagccaga acctgatctc 
2041 aaaaaaacat atgacccaag aattaaaatc tctaaaacat cagtggatgg tgatcctcat 
2101 tttgtagtgg atttccccct aagcaaactc actgtctgct ttaacatcga cggagagccg 
2161 ggggacatcc tacgtctggt ctctgatcat ctgaactctg gtgtgactgt gaatggcgag 
2221 cttattgggg cccccgcacc tccgaatggt cacaagaaac agcgcaccta cttccgcacc 

25 2281 atcaccatcc tcatcaaccg gccagagaga tcttacctgg agatcacacc aagcagggtc 
2341 atcctggacg gtggggacag gctggtactc ccctgcaacc agagcgtggt agtagggagt 
2401 cgaggattgg aggtgtcagt gtctgccaat gccaacatca ccgtggtcat ccagggcaac 
2461 attgcctttg tcatcctcat ccacctgtac aaaaacccag cacccttcca gagagaccac 
2521 ctgggcttct acatcgccaa cagcagaggt ctctccgaca actgccacgg actgctaggt 

30 2581 cagttcctga accaggatgc caaacttgta ggagctcctg aggaatacgg caagaatctt 
2641 agtaaccagc catttcctcg ggcagaaggg atgcctgagg ctatcctgaa ggtgaaaggg 
2701 cgccgagttc cagtggtctg gaaacaaagg aagatttaca acgggcaagc acaggtagac 
2761 tgctggtttg acagaaacaa tgctgccaag ctgattgacg gtgtctataa ggactacctg 
2821 gcatctcatc cgtttgacac agagagtgca ctgggcctga gcacgcccag gaaacctgag 

35 2881 accgacaggc cccatgagga gagtgtctaa aagggaaggg atattgggtc tactctggac 
2941 gcaactcttt ttatgtgtgg ctaccttggc tcttgctatg cgctgtactg tgtagcccgc 
3001 cccctggtga gctggaggtc caatgtctct tagagcaatg agtcagggca ggaagctgaa 
3061 atgcaaatac ttcccttcct gccttcctca tctatactag ataaaaaaaa aaacaaacaa 
3121 ac 

40 SEQIDNO:163 

Amino acid sequence of mouse ITIH5 encoded by the DNA sequence shown in SEQ ID NO: 
162. 

MLLLLGLCLGLPLFSESQEEARSWDDTSEQVVLRVPRQLRLLQRLKTKPXjMAEFSVKSTI 
ISRYAPTTVSCRMliNRASEDQEAEFQMQIPESAFITNPTMLIQDSVYRGEITQKDKKSSE 

45 SVKDKRNRTSDDNEENGSDMFKASLVIPSKDKAAFFLSYEELLQRRLGKYEHSISVRPQQ 
LVGRLTVEVDILERSGITSLEVLPLHNSRKKGSGKAEGDVGPPPSTLINQNETFAKVIFK 
PTWQQAKIAQNGILGDFIVTlYDVEREQNIGDIQVl^GYFVHYFAPKNLPPLPKNVVFVL 
DISASMVGAKLQQTREALVTILNDLRPQDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKI 
YNYHLSPTGGTDINGALQAAIKLLNNYVAQNDIEDRSV&LXIFLTDGKPTFGETNTLKXL 

50 SNTKEATRGQICIFTVGIGDDVDFKLLEKLSIjENCGLTRRVHEEDKAGAQLIGFYDEIRT 
PLLSDIRIDYPPDWEHATKTIiFPNYFNGSEIVIAGKMVDKKFpQLHVEVTASNSKKFVI 
LKRDIPVEFRKMGOT5VSVTPGSARDGGKDLNHIERLWSYLTVKELLSSWRQSNSEQEKEQ 
LRQKAQDIJUjNYHFLTPFTSMKLRKPG1»RTNQLEDTYGMSAATGPATWQNLREAGKQPE 
PDLKKTYDPR I K I S KTS VDGDPH FVVDFPLS KLTVC FN I DGEPGD I LRL VSDHLNSG VTV 

55 NGEIilGAPAPPNGHKKQRTYFRTITILINRPERSYLEITPSRVILDGGDRLVLPCNQSW 
VGSRGLEVSVSANANITWIQGNIAFVILIHLYKNPAPFQRDHLGFYIAKSRGLSDNCHG 
LLGQFLNQDAKLVGAPEEYGKNLSNQPFPRAEGMPEAILKVKGRRVPVVWKQRKIYNGQA 
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QVDCWFDRNNAAKLIDGVYKDYLASHPFDTESALQLSTPRKPETDRPHEESV 

SEQIDNO: 164 

gi|20127658|reflNM_033101.2| Homo sapiens lectin, galactoside-binding, soluble, 12 
(galectin 12) (LGALS12), mRNA 

5 1 acaaaccctc gttggcccca cagggagcca gcctctggct tctctctgca atggccatgt 

61 gctgcagacc cggagtgggt agttagttgg ttaatgccag tcttcctccc ctggacactg 
121 agttctgctg acagcccccg cccagccaga gctctgctgt ataccaccgg gagtggggct 
181 ggtgtggagc ctggaggtcg cccgctgccc tcctagggct gctccagaca gcattaaaac 
241 gctgcaggtc gcaggtgaga ctaacagctg ggagagctgc tccaggcatt taggaccctg 
10 301 actggggcag atgagtcagc ccagtggggg cagggctcct ggaacgagga tctacagttg 
361 gagttgcccc actgtcatgt cacctggaga aaaactggac ccaattcctg acagcttcat 
421 tctgcaacca ccagtcttcc acccggtggt tccttatgtc acgacgattt ttggaggcct 
4 81 gcatgcaggc aagatggtca tgctgcaagg agtggtccct ctagatgcac acaggtttca 
541 ggtggacttc cagtgtggct gcagcctgtg tccccggcca gatatcgcct tccacttcaa 
15 601 ccctcgcttc cataccacca agccccatgt catctgcaac accctgcatg gtggacgctg 
661 gcaaagggag gcccggtggc cccacctggc cctgcgaaga ggctccagct tcctcatcct 
721 ctttctcttc gggaatgagg aagtgaaggt gagtgtgaat ggacagcact ttctccactt 
781 ccgctaccgg ctcccactgt ctcatgtgga cacgctgggt atatttggtg acatcctggt 
841 agaggctgtt ggattcctga acatcaatcc atttgtggag ggcagcagag agtacccagc 
20 901 tggacatcct ttcctgctga tgagccccag gctggaggtg ccctgctcac atgctcttcc 
961 ccagggtctc tcgcctgggc aggtcatcat agtacgggga ctggtcttgc aagagccgaa 
1021 gcattttact gtgagcctga gggaccaggc tgcccatgct cctgtgacac tcagggcctc 
1081 cttcgcagac agaactctgg cctggatctc ccgctggggg cagaagaaac tgatctcagc 
1141 ccccttcctc ttttaccccc agagattctt tgaggtgctg ctcctgttcc aggagggagg 
25 1201 gctgaagctg gcgctcaatg ggcaggggct gggggccacc agcatgaacc agcaggccct 
1261 ggagcagctg cgggagctcc ggatcagtgg aagtgtccag ctctactgtg tccactcctg 
1321 aggatggttc caggggaaat accgccagaa aacaagaagg tcagcccact cccagggccc 
1381 cactctcctc ccctcattaa accatccacc tgacaccagc acatcaggcc tggttcacct 
1441 ctggggtcac gagactgagt ctacaggagc tttgggcctg agggaaggca caagagtgca 
30 1501 aaggttcctc gaactctgca ccttcctcca ccaggagcct gggatatggc tccatctgcc 
1561 ttcagggcct ggactgcact cacagaggca agtgttgtag actaacaaag atactccaaa 
1621 atacaatggc ttaaagaatg tggtcattta ttctttatta tttatttatt tgtggtcaaa 
1681 taaataaata aggtta 

SEQIDNO: 165 

35 Amino acid sequence of human LGALS12 encoded by the DNA sequence shown in SEQ ID 
NO: 164. 

MSQPSGGRAPGTRIYSWSCPTVMSPGEKLDPIPDSPILQPPVPHPWpyNm'IPGGLHAG 
KI^LQGWPLDAHRPQVDFQCGCSLCPRPDIAFHPNPRFHTTKPHVICNTLHGGRWQ^ 
ARWPHLALRRGSSFLILFLFGNEEVKVSVNGQHFLHFRYRLPIiSHVDTLGIFGDILVEAV 
40 GFLNINPFVEGSREYPAGHPFLLMSPRLBVPCSHALPQGLSPGQVIIVRGIjVLQEPKHFT 
VSLRDQAAH70>VTLRASFADRTLAWISRWGQKKLISAPFLFYPQRFFEVLLLFQEGGLKL 
ALNGQGLGATSMNQQALEQLRELRISGSVQLYCVHS 

SEQIDNO: 166 

gi|15010855|gb|AF244977.1|AF244977 Homo sapiens galectin-12 isoform d mRNA, 
45 complete cds 

1 cctcagcttc ccagagtgct gagattacag gcatgagcca cagagcccag cccaggtaac 
61 ttctacatgg ggaaaggtga gcctgggaag gccattgtta agtgcctctc caggggtcca 
121 tttcagagtt agctgtcagt tcctaacttg tccattcttc ttgcctccca gttcctagtc 



230 



BNSDOCID: <WO__200800740CA2J_> 



WO 2006/007400 



PCT/US2005/021297 



181 tccagccact caacagtcct cagaaagccc cagtaccagg cttctgggga ccagggtgac 
241 atctagtgga tggcaagcct ccctggggtc actgcagcct gttgtgtcct aggagctggg 
301 gaaaggcaac tggctggtgc tgaatttggc aagggagcgg aatttgactt ctctggtgta 
361 atgcagcttt gccgtgtgac ggtccaggaa aggggattag gtggaagaaa atatttcagt 
5 421 gaacactgat ttttacctat aaggaatttt ctgtttggag aggaaaagtt gagttttatt 
481 ctgctccttt cctcccactc gcctgacata gaggtcccta agccctttct ccaaacctgc 
541 atggatgagt ttcttttctt gttcaggtgg ttccttatgt cacgacgatt tttggaggcc 
601 tgcatgcagg caagatggtc atgctgcaag gagtggtccc tctagatgca cacaggtttc 
661 aggtggactt ccagtgtggc tgcagcctgt gtccccggcc agatatcgcc ttccacttca 
10 721 accctcgctt ccataccacc aagccccatg tcatctgcaa caccctgcat ggtggacgct 
781 ggcaaaggga ggcccggtgg ccccacctgg ccctgcgaag aggctccagc ttcctcatcc 
841 tctttctctt cgggaatgag gaagtgaagg tgagtgtgaa tggacagcac tttctccact 
901 tccgctaccg gctcccactg tctcatgtgg acacgctggg tatatttggt gacatcctgg 
961 tagaggctgt tggattcctg aacatcaatc catttgtgga gggcagcaga gagtacccag 
15 1021 ctggacatga ggtgccctgc tcacatgctc ttccccaggg tctctcgcct gggcaggtca 
1081 tcatagtacg gggactggtc ttgcaagagc cgaagcattt tactgtgagc ctgagggacc 
1141 aggctgccca tgctcctgtg acactcaggg cctccttcgc agacagaact ctggcctgga 
1201 tctcccgctg ggggcagaag aaactgatct cagccccctt cctcttttac ccccagagat 
1261 tctttgaggt gctgctcctg ttccaggagg gagggctgaa gctggcgctc aatgggcagg 
20 1321 ggctgggggc caccagcatg aaccagcagg ccctggagca gctgcgggag ctccggatca 
1381 gtggaagtgt ccagctctac tgtgtccact cctgaggatg gttccagggg aaataccgcc 
1441 agaaaacaag aaggtcagcc cactcccagg gccccactct cctcccctca ttaaaccatc 
1501 cacctgacac cagcacatca ggcctggttc acctctgggg tcacgagact gagtctacag 
1561 gagctttggg cctgagggaa ggcacaagag tgcaaaggtt cctcgaactc tgcaccttcc 
25 1621 tccaccagga gcctgggata tggctccatc tgccttcagg gcctggactg cactcacaga 
1681 ggcaagtgtt gtagactaac aaagatactc caaaatacaa tggcttaaag aatgtggtca 
1741 tttattcttt attatttatt tatttgtggt caaataaata aataaggtta 

SEQIDNO: 167 

Amino acid sequence of human LGALS12 variant ORF number 1 encoded by the DNA 
30 sequence shown in SEQ ED NO: 1 66. 

MVMLQGWPLDAHRFQVDPQCGCSLCPRPDIAPttPNPRFHTTKPHVICNTLHGGRWQREA 

RWPHLALRRGSSFLILFLFGNEEVKVSVNGQHFLHFRYRLPLSHVDTLGIPGDIIiVBAVG 

FLNINPFVEGSREYPAGHEVPCSHALPQGLSPGQVIIVRGLVIjQEPKHFTVSLRDQAAHA 

PVTLRAS FADRTLAWI SRWGQKKLI S APFLFYPQRFFEVLLLFQEGGLKIiALNGQGLGAT 

35 SMNQQALEQLRELRISGSVQLYCVHS 
■ 

SEQIDNO: 168 

gi|15010853|gb|AF244976.1|AF244976 Homo sapiens galectin-12 isofoim c mRNA, 
complete cds 

1 acaaaccctc gttggcccca cagggagcca gcctctggct tctctctgca atggccatgt 
40 61 gctgcagacc cggagtgggt agttagttgg ttaatgccag tcttcctccc ctggacactg 
121 agttctgctg acagcccccg cccagccaga gctctgctgt ataccaccgg gagtggggct 
181 ggtgtggagc ctggaggtcg cccgctgccc tcctagggct gctccagaca gcattaaaac 
241 gctgcaggtc gcaggtgaga ctaacagctg ggagagctgc tccaggcatt taggaccctg 
301 actggggcag atgagtcagc ccagtggggg cagggctcct ggaacgagga tctacagttg 
45 361 gagttgcccc actgtcatgt cacctggaga aaaactggac ccaattcctg acagcttcat 
421 tctgcaacca ccagtcttcc acccggtggt tccttatgtc acgacgattt ttggaggcct 
481 gcatgcaggc aagatggtca tgctgcaagg agtggtccct ctagatgcac acaggtttca 
541 ggtggacttc cagtgtggct gcagcctgtg tccccggcca gatatcgcct tccacttcaa 
601 ccctcgcttc cataccacca agccccatgt catctgcaac accctgcatg gtggacgctg 
50 661 gcaaagggag gcccggtggc cccacctggc cctgcgaaga ggctccagct tec teat cct 
721 ctttctcttc gggaatgagg aagtgaaggt gagtgtgaat ggacagcact ttctccactt 
781 ccgctaccgg ctcccactgt ctcatgtgga cacgctgggt atatttggtg acatcctggt 
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841 agaggctgtt ggattcctga acatcaatcc atttgtggag ggcagcagag agtacccagc 

901 tggacatgag gtgccctgct cacatgctct tccccagggt ctctcgcctg ggcaggtcat 

961 catagtacgg ggactggtct tgcaagagcc gaagcatttt actgtgagcc tgagggacca 

1021 ggctgcccat gctcctgtga cactcagggc ctccttcgca gacagaactc tggcctggat 

5 1081 ctcccgctgg gggcagaaga aactgatctc agcccccttc ctcttttacc cccagagatt 

1141 ctttgaggtg ctgctcctgt tccaggaggg agggctgaag ctggcgctca atgggcaggg 

1201 gctgggggcc accagcatga accagcaggc cctggagcag ctgcgggagc tccggatcag 

1261 tggaagtgtc cagctctact gtgtccactc ctgaggatgg ttccagggga aataccgcca 

1321 gaaaacaaga aggtcagccc actcccaggg ccccactctc ctcccctcat taaaccatcc 

10 1381 acctgacacc agcacatcag gcctggttca cctctggggt cacgagactg agtctacagg 

1441 agctttgggc ctgagggaag gcacaagagt gcaaaggttc ctcgaactct gcaccttcct 

1501 ccaccaggag cctgggatat ggctccatct gccttcaggg cctggactgc actcacagag 

1561 gcaagtgttg tagactaaca aagatactcc aaaatacaat ggcttaaaga atgtggtcat 

1621 ttattcttta ttatttattt atttgtggtc aaataaataa ataaggtta 

15 SEQIDNO:169 

Amino acid sequence of human LGALS12 variant ORF number 2 encoded by the DNA 
sequence shown in SEQ ID NO: 168. 

MSQPSGGRAPGTRIYSWSCPTVMSPGEKLDPIPDSFILQPPVFHPWPYVTTIFGGLHAG 
KMVMLQGWPLDAHRFQVDFQCG CS LCPRPD I AFHFNPRFHTTKPHV I CNTLHGGRWQRE 
20 ARWPHLALRRGS S FL I L FLFGNEE VKVS VNGQHFLHFR YRLPLSHVDTLGI FGD I LVEAV 
GFLNINPFVEGSREYPAGHEVPCSHALPQGLSPGQVIIVRGLVLQEPKHFTVSLRDQAAH 
APVTLRASFADRTLAWISRWGQKKLISAPFLFYPQRFFEVLLLFQEGGLKLAIjNGQGLGA 
TSMNQQALEQLRELRISGSVQliYCVHS 

SEQ ID NO: 170 

25 gi 1 1 5 0 1 085 1 |gb| AF244975. 1 1 AF244975 Homo sapiens galectin-12 isofonn b mRNA, 
complete cds 

1 cctcagcttc ccagagtgct gagattacag gcatgagcca cagagcccag cccaggtaac 
61 ttctacatgg ggaaaggtga gcctgggaag gccattgtta agtgcctctc caggggtcca 
121 tttcagagtt agctgtcagt tcctaacttg tccattcttc ttgcctccca gttcctagtc 
30 181 tccagccact caacagtcct cagaaagccc cagtaccagg cttctgggga ccagggtgac 
241 atctagtgga tggcaagcct ccctggggtc actgcagcct gttgtgtcct aggagctggg 
301 gaaaggcaac tggctggtgc tgaatttggc aagggagcgg aatttgactt ctctggtgta 
361 atgcagcttt gccgtgtgac ggtccaggaa aggggattag gtggaagaaa atatttcagt 
421 gaacactgat ttttacctat aaggaatttt ctgtttggag aggaaaagtt gagttttatt 
35 481 ctgctccttt cctcccactc gcctgacata gaggtcccta agccctttct ccaaacctgc 
541 atggatgagt ttcttttctt gttcaggtgg ttccttatgt cacgacgatt tttggaggcc 
601 tgcatgcagg caagatggtc atgctgcaag gagtggtccc tctagatgca cacaggtttc 
661 aggtggactt ccagtgtggc tgcagcctgt gtccccggcc agatatcgcc ttccacttca 
721 accctcgctt ccataccacc aagccccatg tcatctgcaa caccctgcat ggtggacgct 
40 781 ggcaaaggga ggcccggtgg ccccacctgg ccctgcgaag aggctccagc ttcctcatcc 
841 tctttctctt cgggaatgag gaagtgaagg tgagtgtgaa tggacagcac tttctccact 
901 tccgctaccg gctcccactg tctcatgtgg acacgctggg tatatttggt gacatcctgg 
961 tagaggctgt tggattcctg aacatcaatc catttgtgga gggcagcaga gagtacccag 
1021 ctggacatcc tttcctgctg atgagcccca ggctggaggt gccctgctca catgctcttc 
45 1081 cccagggtct ctcgcctggg caggtcatca tagtacgggg actggtcttg caagagccga 
1141 agcattttac tgtgagcctg agggaccagg ctgcccatgc tcctgtgaca ctcagggcct 
1201 ccttcgcaga cagaactctg gcctggatct cccgctgggg gcagaagaaa ctgatctxrag 
1261 cccccttcct cttttacccc cagagattct ttgaggtgct gctcctgttc caggagggag 
1321 ggctgaagct ggcgctcaat gggcaggggc tgggggccac cagcatgaac cagcaggccc 
50 1381 tggagcagct gcgggagctc cggatcagtg gaagtgtcca gctctactgt gtccactcct 
1441 gaggatggtt ccaggggaaa taccgccaga aaacaagaag gtcagcccac tcccagggcc 
1501 ccactct'cct cccctcatta aaccatccac ctgacaccag cacatcaggc ctggttcacc 
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1561 tctggggtca cgagactgag tctacaggag ctttgggcct gagggaaggc acaagagtgc 

1621 aaaggttcct cgaactctgc accttcctcc accaggagcc tgggatatgg ctccatctgc 

1681 cttcagggcc tggactgcac tcacagaggc aagtgttgta gactaacaaa gatactccaa 

1741 aatacaatgg cttaaagaat gtggtcattt attctttatt atttatttat ttgtggtcaa 

5 1801 ataaataaat aaggtta 

SEQIDN0:171 

Amino acid sequence of human LGALS12 variant ORF number 3 encoded by the DNA 
sequence shown in SEQ ID NO: 170. 

MVMLQGWPIJ)AHRFQVDFQCGCSLCPRPDIAFHFNPRFHTTKPHVICNTLHGGRWQREA 
1 0 RWPHLALRRGSSFLILFLFGNEEVKVS VNGQHFLHFRYRLPLSHVDTLGI FGDILVEAVG 
FLNINPFVEGSREYPAGHPFLLMSPRLEVPCSHALPQGLSPGQVIIVRGLVLQEPKHFW 
S LRDQAAHAPVTLRAS FADRTLAWI SRWGQKKLI S APFLF YPQRFFE VLLLFQEGGLKLA 
LNGQGLGATSMNQQALEQLRELR2SGSVQLYCVHS 

SEQ ID NO: 172 

15 gi|11878242|gb|AF310686.1|AF310686 Homo sapiens galectin-12 splice form 1 mRNA, 
complete cds, alternatively spliced 

1 agttggagtt gccccactgt catgtcacct ggagaaaaac tggacccaat tcctgacagc 
61 ttcattctgc aaccaccagt cttccacccg gtggttcctt atgtcacgac gatttttgga 
121 ggcctgcatg caggcaagat ggtcatgctg caaggagtgg tccctctaga tgcacacagg 
20 181 tttcaggtgg acttccagtg tggctgcagc ctgtgtcccc ggccagatat cgccttccac 
241 ttcaaccctc gcttccatac caccaagccc catgtcatct gcaacaccct gcatggtgga 
301 cgctggcaaa gggaggcccg gtggccccac ctggccctgc gaagaggctc cagcttcctc 
361 atcctctttc tcttcgggaa tgaggaagtg aaggtgagtg tgaatggaca gcactttctc 
421 cacttccgct accggctccc actgtctcat gtggacacgc tgggtatatt tggtgacatc 
25 481 ctggtagagg ctgttggatt cctgaacatc aatccatttg tggagggcag cagagagtac 
541 ccagctggac atcctttcct gctgatgagc cccaggctgg aggtgccctg ctcacatgct 
601 cttccccagg gtctctcgcc tgggcaggtc atcatagtac ggggactggt cttgcaagag 
661 ccgaagcatt ttactgtgag cctgagggac caggctgccc atgctcctgt gacactcagg 
721 gcctccttcg cagacagaac tctggcctgg atctcccgct gggggcagaa gaaactgatc 
30 781 tcagccccct tcctctttta cccccagaga ttctttgagg tgctgctcct gttccaggag 
841 ggagggctga agctggcgct caatgggcag gggctggggg ccaccagcat gaaccagcag 
901 gccctggagc agctgcggga gctccggatc agtggaagtg tccagctcta ctgtgtccac 
961 tcctgaggat ggttccaggg aaataccgcc agaaaacaag aaggtcagcc cactcccagg 
1021 gccccactct cctcccctca ttaaaccatc cacctgacac cagcacatca ggcctggttc 
35 1081 acctctgggg tcacgagact gagtctacag gagctttggg cctgagggaa ggcacaagag 
1141 tgcaaaggtt cctcgaactc tgcaccttcc tccaccagga gcctgggata tggctccatc 
1201 tgccttcagg gcctggactg cactcacaga ggcaagtgtt gtagactaac aaagatactc 
1261 caaaatacaa tggcttaaag aatgtggtca tttattcttt attatttatt tatttgtggt 
1321 caaataaata aataaggtta tttattt 

40 SEQ ID NO: 173 

Amino acid sequence of human LGALS12 variant ORF number 4 encoded by the DNA 
sequence shown in SEQ ID NO: 172. 

mspgekldp i tos FI lqppvfhpvvpyvttifgglhagkmvmlqgvvpldahrfqvdfqc 

GCSIiCPROTIAFHFNPRFHTTKPHVICOTI^GGRWQREARWPffl 
45 EEVKVSVNGQHFLHFRyRLPLSHVDTLGIFGDILVEAVGFLNINPFVEGSREYPAGHPFL 
LMSPRLEVPCSHALPQGLSPGQVI IVRGLVLQEPKHFTVSLRDQAAHAPVTLRAS fadrt 
LAW I SRWGQKKLI SAP FL FY PQRFFEVLLLFQEGGLKLAI^GQGLGATSMNOQALEQLRE 
LRISGSVQLYCVHS 
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SEQIDNO:174 

gi|l 1878244|gb|AF310687.t|AF310687 Homo sapiens galectin-12 splice form 2 mRNA, 
complete cds, alternatively spliced 

X agttggagtt gccccactgt catgtcacct ggagaaaaac tggacccaat tcctgacagc 
5 61 ttcattctgc aaccaccagt cttccacccg gtggttcctt atgtcacgac gatttttgga 
121 ggcctgcatg caggcaagat ggtcatgctg caaggagtgg tccctctaga tgcacacagt 
181 aggtttcagg tggacttcca gtgtggctgc agcctgtgtc cccggccaga tatcgccttc 
241 cacttcaacc ctcgcttcca taccaccaag ccccatgtca tctgcaacac cctgcatggt 
301 ggacgctggc aaagggaggc ccggtggccc cacctggccc tgcgaagagg ctccagcttc 
10 361 ctcatcctct ttctcttcgg gaatgaggaa gtgaaggtga gtgtgaatgg acagcacttt 
421 ctccacttcc gctaccggct cccactgtct catgtggaca cgctgggtat atttggtgac 
481 atcctggtag aggctgttgg attcctgaac atcaatccat ttgtggaggg cagcagagag 
541 tacccagctg gacatccttt cctgctgatg agccccaggc tggaggtgcc . ctgctcacat 
601 gctcttcccc agggtctctc gcctgggcag gtcatcatag tacggggact ggtcttgcaa 
15 661 gagccgaagc attttactgt gagcctgagg gaccaggctg cccatgctcc tgtgacactc 
721 agggcctcct tcgcagacag aactctggcc tggatctccc gctgggggca gaagaaactg 
781 atctcagccc ccttcctctt ttacccccag agattctttg aggtgctgct cctgttccag 
841 gagggagggc tgaagctggc gctcaatggg caggggctgg gggccaccag catgaaccag 
901 caggccctgg agcagctgcg ggagctccgg atcagtggaa gtgtccagct ctactgtgtc 
20 961 cactcctgag gatggttcca gggaaatacc gccagaaaac aagaaggtca gcccactccc 
1021 agggccccac tctcctcccc tcattaaacc atccacctga caccagcaca tcaggcctgg 
1081 ttcacctctg gggtcacgag actgagtcta caggagcttt gggcctgagg gaaggcacaa 
1141 gagtgcaaag gttcctcgaa ctctgcacct tcctccacca ggagcctggg atatggctcc 
1201 atctgccttc agggcctgga ctgcactcac agaggcaagt gttgtagact aacaaagata 
25 1261 ctccaaaata caatggctta aagaatgtgg tcatttattc tttattattt atttatttgt 
1321 ggtcaaataa ataaataagg ttatttattt 

SEQIDNO:175 

Amino acid sequence of human LGALS12 variant ORF number 5 encoded by the DNA 
sequence shown in SEQ ID NO: 1 74. 

30 MSPGEKLDPIPDSFILQPPVFHPVVPYVTTIFGGLHAGKMVMLQGVVPLDAHSRFQVDFQ 
CGCSLCPRPDIAFHFNPRFHTTKPHVICNTLHGGRWQREARWPHLALRRGSSFLILFLFG 
NBEVKVSVNGQHFLHFRYRLPLSHVDTLGIFGDILVEAVGFLNINPFVBGSREYPAGHPF 
LLMS PRLEVPCSHALPQGLS PGQVI I VRGLVLQEPKHFTVSLRDQAAHAP VTLRASFADR 
TLAWISRWGQKKLISAPFLFYPQRFFEVLIiLFQEGGLKIjALNGQGIXaATSMNQQALEQLR 

35 ELRISGSVQLYCVHS 

SEQ ID NO: 176 

gi|15010845|gb|AF244978.1 |AF244978 Mus musculus galectin-12 mRNA, variant a, 
complete cds, alternatively spliced 

1 cacgtgtacc accagcagca gcagcttccc ttcctactat ccgctgaagg cccagctgtt 
40 61 gcagaggctg ggaaaagtga gtccattccg tggggaatcg cgttgctgga atcaggatca 
121 gaggctgagt tgaagttgcc cctcccccat gtcaactgac gaacacctgg acccgattcc 
181 tgacagcttc atcctgcagc cgccagtctt ccacccggtg attccttatg gcacaacaat 
241 ttttggtggc ctgtatgcag gcaagatggt cacgctgcag ggtgtggtcc ctctgcatgc 
301 aaggaggttt eaggtggact tccagtgtgg gtgctgcctg catcctcagc cagatgttgc 
45 361 cttccgcttc agccctcgat tctacactgt caagccccat gtcatctgca acacccacca 
421 aggtggactc tggcaaaaag agatacggtg gccaggggtc gccctgcaga gaggggatag 
481 cttcctcatt ctctttctct ttgagaacga agaggtgaag gtgagtgtaa atggccagca 
541 ctttcttcac taccgctacc ggctcccact gtcacgggta gatacccttg acatatctgg 
601 tgacatcttg gtaaaggctg ttggattcct gaacatcaat ccatttgtgg agggtagcag 
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661 agagtatcca gttggatatc ccttcctgct 
721 acgtgctctt cctcggggtc tctggcctgg 
781 gaaagagccg aaagatttta ccctgagcct 
841 actcagggct tccttcacag acagaacact 
5 901 gctgatctcc gcccccttcc tctttcaccc 
961 ccaagaggga gggctgaagt tggcactcaa 
1021 ccagaaagcc ctggagcagc tgcgggagct 
1081 tgtccactgc taagaagggc tccaagggca 
1141 aaccagggct ccacagttac agcccattct 
10 1201 tcactttgtc aggccgagct cacatctcaa 



gtatagcccc aggctggagg tgccctgctc 
gcaagtcatt gtagttcgag gactggtctt 
gaaggatggg accacccatg ctcctgtgac 
ggcctgggtc tcctcctggg gacgaaagaa 
ccagcgattc ttcgaggtac tgcttctgtg 
tgggcagggg ctgggggcca ccagcctgga 
caggatcagt ggaaatgtcc acctctactg 
cccccgccag agaagaggag gggcggctgc 
cctattctca agggagaaac tgacctggca 
agggcgcgcg cacacacaca c 



SEQIDNO: 177 

Amino acid sequence of mouse LGALS12 encoded by the DNA sequence shown in SEQ ID 
NO: 176. 

1 5 MSTDEHLDP I PDS F I LQPP VFHP VI P YGTTI FGGLYAGKMVTLQGVVPLHARRFQVD FQC 
GCCLHPQPDVAFRFSPRFYTVKPHVICNTHQGGLWQKEIRWPGVAIjQRGDSFLILFLFEN 
EEVKVSVNGQHFLHYRYRLPLSRVDTLDISGDILVKAVGFLNINPFVEGSREYPVGYPFIi 
LYSPRLEVPCSRALPRGLWPGQVIVWGLVLKEPKDFTLSLKDGTTHAPVTLRASFTDRT 
lAWSSWGRKKLISAPFLFHPQRFFEVLLLCQEGGLKLA^ 

20 LRI SGNVHLYCVHC 



SEQ ID NO: 178 

gi|34861855|ref|XM_219545.2| Rattus norvegicus similar to galectin-12 (LOC293710), 
mRNA 

1 atgtcaactg acgaacacct ggacccgatc cctgacagct tcatcctgca gccgccagtc 
25 61 ttccaccctg tgggtgctga gcttgctgaa gggaaagaca tggctccagt tttcttatac 

121 agtgatcaga atggaaaaac agccgggcag atcctctata ctggggccat ggaccgagca 
181 agcattgtga ttccttatgt cacaacaatt ttcggtggcc tgtatgcagg caagatgatc 
241 atgctgcagg gtgtggtccc tcgccatgca cggaggtttc aggtggactt ccagtgtggg 
301 tgctgcctgc atcctcggcc agatgttgcc ttccacttca gccctcgctt ctacactgtc 
30 361 aagccccatg tcatctgcaa caccctccaa ggtggactct ggcagaaaga ggtccggtgg 
421 ccaggaatcg ccctgcagaa aggggctagc ttcctcatcc tctttctctt tgacaatgaa 
481 gaggtgaagg tgagtgtgaa cggacagcac tttcttcact accgctatcg gctcccactg 
541 tcacgggtgg atactcttga tatatctggc gacatcttgg taaaggctgt tggattcctg 
601 aacatcagtc cattcgtgga gggtagcaga gagtatccag ttggatatcc cctcctgctg 
35 661 tatagcccca ggctgtacca agcatcaggc caggaaaacc atccagcccc agacctccag 
721 ctgctcatcg gtggcagaat gacaaaccac tgtatacctg ctacatacag ttcagagcac 
781 actctggaac aaggacctgt gcacagccca gagagcctga agaagctctc agcaggctta 
841 catttcaccc tgagcctgag ggatggggcc acccatgttc ctgtgacact cagggcttcc 
901 ttcacagaca gaacactggc ctgggtctcc tcctggggac gaaagaagct gatctcagcc 
40 961 cccttcctct tttatcccca gcgattcttc gaggttctgc ttttgtgcca agagggaggg 
1021 ctgaagctgg cactcaatgg gcatgggctg ggggccacca gcctggatca gaaagccctg 
1081 gagcagctgc gggacctcag gatcagtgga aggctccaca gttacagccc caccctgcct 
1141 ttctcactgg agaacttgcc ctggcatcac tgtttcaggc ctagttcaca tctggtcact 
1201 caactagatt gtggagcctt tgcccaaagt gaacaaatat cacagctgct ttcatggttt 
45 1261 tggtcaaagg gaaccagcca catcccagaa aaacacactg ataatgaggc cacattctgg 
1321 tcacctaaca ataatcttgc ataa 

SEQIDNO: 179 

Amino acid sequence of rat LGALS12 encoded by the DNA sequence shown in SEQ ID NO: 
178. 
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MSTDEHLDPIPDSFILQPPVPHPVGAELAEGKDMAPVFLYSDQNGKTAGQILYTGAMDRA 
SIVIPYVTTIFGGLYAGKMIMLQGVVPRHARRFQVDFQCGCCLHPRPDVAFHFSPRFYTV 
KPHVICNTLQGGLWQKEVRWPGIALQKGASFLILFLFDNEEVKVSVNGOHFLHYRYRLPL 
SRVDTLDISGDILVKAVGFLNISPFVEGSREYPVGYPLLLYSPRLYQASGQENHPAPDLQ 
5 LLIGGRMTNHCIPATYSSBHTLEQGPVHSPESLKKLSAGLHFTLSIjRDGATHVPVT 

FTDRTLAWSSHGRKKLISAPFLFYPQRFFEVLLLCQEGGLKLALNGHGLGATSLDQK^ 
EQLRDLRISGRLHSYSPTLPFSLENLPWHHCFRPSSHLVTQLDCGAFAQSEQISQLLSWF 
WS KGTS H I PEKHTDNEATFWS PNNNLA 

SEQ ID NO: 180 

10 gi|24475648|reflNM_021077.2| Homo sapiens neuromedin B (NMB), mRNA 

1 cgcgcgcccg aacgaagccg cggcccgggc acagccatgg cccggcgggc ggggggcgct 
61 cggatgttcg gcagcctcct gctcttcgcc ctgctcgctg ccggcgtcgc cccgctcagc 
121 tgggatctcc cggagccccg cagccgagcc agcaagatcc gagtgcactc gcgaggcaac 
181 ctctgggcca ccggtcactt catgggcaag aagagtctgg agccttccag cccatcccat 

15 241 tggggacagc tccccacacc tcccctgagg gaccagcgac tgcagctgag tcatgatctg 
301 ctcggaatcc tcctgctaaa gaaggctctg ggcgtgagcc tcagccgccc cgcaccccaa 
361 atccagtaca ggaggctgct ggtacaaata ctgcagaaat gacaccaata ataggggcag 
421 acacaacagc gtggcttaga ttgtgcccac ccagggaagg tgctgaatgg gaccctgttg 
481 atggccccat ctggatgtaa atcctgagct caaatctctg ttactccatt actgtgattt 

20 541 ctggctgggt caccagaaat atcgctgatg cagacacaga ttatgttcct gctgtatttc 
601 ctgcttccct gttgaattgg tgaataaaac cttgctcttt 

SEQ ID NO: 181 

Amino acid sequence of human NMB encoded by the DNA sequence shown in SEQ ID NO: 
180. 

25 MARRAGG ARMFGS LLL F ALLAAGVAPLS WDLPE PRSRAS KX R VHSRGNLWATGHFMGKKS 
LEPSSPSHWQQLPTPPLRDQRLQLSHDLLGILIiLKKALGVSLSRPAPQIQYRRLLVQILQ 

K 

SEQ ID NO: 182 

Amino acid sequence of human NMB, a soluble active secreted form derived from SEQ ID 
30 NO:181. 

APLSWDLPEPRSRASKIRVHSRGNLWATGHFMGKKSLEPSSPSHWGQLPTPPLRDQRLQL 
SHDLLGILLLKKALGVSLSRPAPOIQYRRLLVQILQK 

SEQ ID NO: 183 

gi|13386017|reflNM_026523.1| Mus musculus neuromedin B (Nmb), mRNA 

35 1 gatttggcgc gcttcgagaa ctagtatcct gagagtgcca gagaagagcc tgtttggcac 

61 agccatgacc cggcaagcag ggagctcttg gctcctccgt ggtctcctgc tcttcgcatt 
121 gttcgcttcc ggcgtcgctc ccttcaactg ggatctcccg gagccccgca gccgagcaag 
181 eaagattcga gtgcaccctc ggggcaacct ctgggcgacc ggtcacttca tgggcaagaa 
241 gagcctggaa cccccgagcc tgtcactggt ggggacagca ccccctaaca ccccgaggga 

40 301 ccagagacta cagctgagtc atgatctgct caggatcctc ctgcgaaaga aagctctagg 
361 catgaacttc agtggcccag ctcccccaat ccagtacagg aggctgctgg agccactact 
421 gcagaagtga tgccaataat ggaacaaacc ggatgctggg cttagaatgt gtccatccag 
481 ggaagctgac aatggaaccc tagcagtggc ctcctctgga tgtaaatcct aagctcaaat 
541 gtgttaatct gttactgtga tttctggttg ggtcaccagg aatgttaatg atgcagacac 
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601 aaagtctttc ctgctgtatt tcttgcttcc ctggtgaagt ggtgaataaa aatactctct 
661 tc 

SEQIDNO: 184 

Amino acid sequence of mouse NMB encoded by the DNA sequence shown in SEQ ID NO: 
5 183. 

MTRQAQSSWLLRGLUiFALFASGVAPFNWDLPEPRSRASKIRVHPRGNLWATGHFMGKKS 
LEPPSLSLVGTAPPNTPRDQRLQLSHDLLRILLRKKALGMNFSGPAPPIQYRRLLEPLLQ 
K 

SEQIDNO: 185 

10 ENSRNOESTT00000034038 cDNA sequence, EnsEMBL transcript [Rattus norvegicus] 

1 cagagaaaac ccgtttggca cagccatgac ccggcaagca gggaggactt ggctgctccg 

61 tggtctcctg ctctttgcct tgttcgtttc cggcatcacc cccttttcct gggatctccc 

121 ggagccccgc agccgagcaa gcaagattcg agtgcaccct cggggcaacc tctgggcgac 

181 tggtcacttc atgggcaaga agagtctgga acccccgagc ctgtcactgg tgggtacagc 

15 241 accccctatc acccagaggg agcagagact acagctgagt catgatctgc tcaggatcct 

301 cctgctacag aaagcgctag gcatgaacct cagtggtcca gctcccccaa tccagtacag 

361 gaggctgctg cagaagtgac gccaataatg gaacaaacac ggcactgggc gtagcatgcg 

421 tccatccagg gaaactgagg aatggaaccc tagcagtggc ctcctctgga tgtgaaacct 

481 aagctcaaat gtgttactct gttactgtga tttctggttg ggtcaccagg aatgttaacg 

20 541 atacagacac agtctttcct gctgtatttc ttgcttccct ggtgaagttg tgaataaaaa 

601 tactgctctt tacgaaga 

SEQIDNO: 186 

Amino acid sequence of rat NMB encoded by the DNA sequence shown in SEQ ID NO: 185. 

MTRQAGSTWLLRGLLLFALFVSGITPFSWDLPEPRSRASKIRVHPRGNLWATGHFMGKKS 
25 LEPPSLSLVGTAPPITQREQRLQLSHDLLRILLLQKALGMNLSGPAPPIQYRRLLQK 

SEQIDNO: 187 

gi|32307134|ref]NM 005386.2| Homo sapiens neuronatin (NNAT), transcript variant I, 
mRNA 

1 taggtggcgg gcgggtactt aaggcgcggc caccgcggct gcggcagtgc gcccaacagc 
30 61 ggactccgag accagcggat ctcggcaaac cctctttctc gaccacccac ctaccattct 
121 tggaaccatg gcggcagtgg cggcggcctc ggctgaactg ctcatcatcg gctggtacat 
161 cttccgcgtg ctgctgcagg tgttcctgga atgctgcatt tactgggtag gattcgcttt 
241 tcgaaatcct ccagggacac agcccattgc gagaagtgag gtgttcaggt actccctgca 
301 gaagctggca tacacggtgt cgcggaccgg gcggcaggtg ttgggggagc gcaggcagcg 
35 361 agcccccaac tgaggcccca gctcccagcc ctgggcggcc gtatcatcag gtgctcctgt 
421 gcatctcggc cagcacggga gccagtgccg cgcaggaatg tggggtcccc tgtgttccct 
481 cgccagagga gcacttggca aggtcagtga ggggccagta gacccccgga gaagcagtac 
541 cgacaatgac gaagatacca gatcccttcc caaccccttt gcaccggtcc cactaagggg 
601 cagggtcgag agaggagggg ggataggggg agcagatccc tgagatctgg gcataggcac 
40 661 cgcattctga tctggacaaa gtcgggacag caccatccca gccccgaagc cagggccatg 
721 ccagcaggcc ccaccatgga aatcaaaaca ccgcaccagc cagcagaatg gacattctga 
781 catcgccagc cgacgccctg aatcttggtg cagcaccaac cgcgtgcctg tgtggcggga 
841 ctggagggca cagttgagga aggagggtgg ttaagaaata cagtggggcc ctctcgctgt 
901 cccttgccca gggcacttgc attccagcct cgctgcattt gctctctcga ttcccctttc 
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961 ctcctcactg cctcccaagc ccaccctact 
1021 attcaagcag taaaagtaaa tgaatcccac 
1081 ttgcccctca ctgatcttgc ttttccctgg 
1141 atcgctaatt gtactgattg tttaagtgtg 
5 1201 agctcctgga ggacagggac cacctctaca 
1261 cacagtgtcc caggaccctg cggtgcagta 
1321 aaaaaaaaaa aaaaaaaa 



ccaaaataat gtgtcacttg atttggaact 
ctttactaaa acactttctc tgaacccccc 
tctcatgcag ttgtggtcaa tattgtggta 
cattagttgt gtctccccag ctagattgta 
aaaaataaaa aaagtacctc ccctgtctcg 
gaggcgcacc aaaaaaaaaa aaaaaaaaaa 



SEQ ID NO: 188 



Amino acid sequence of human NNAT encoded by the DNA sequence shown in SEQ ID 
10 NO: 187. 



MAAVAAASAELIillGWYIPRVLLQVPLECCIYWVGFAFRNPPGTQPIARSEVFRYSIaQKL 
AYTVSRTGRQVLGERRQRAPN 

SEQ ID NO: 189 

gi|32307135|ref|NM_181689.1| Homo sapiens neuronatin (NNAT), transcript variant 2, 
15 mRNA 

1 taggtggcgg gcgggtactt aaggcgcggc caccgcggct gcggcagtgc gcccaacagc 
61 ggactccgag accagcggat ctcggcaaac cctctttctc gaccacccac ctaccattct 
121 tggaaccatg gcggcagtgg cggcggcctc ggctgaactg ctcatcatcg gctggtacat 
181 cttccgcgtg ctgctgcagg tgttcaggta ctccctgcag aagctggcat acacggtgtc 
20 241 gcggaccggg cggcaggtgt tgggggagcg caggcagcga gcccccaact gaggccccag 
301 ctcccagccc tgggcggccg tatcatcagg tgctcctgtg catctcggcc agcacgggag 
361 ccagtgccgc gcaggaatgt ggggtcccct gtgttccctc gccagaggag cacttggcaa 
421 ggtcagtgag gggccagtag acccccggag aagcagtacc gacaatgacg aagataccag 
481 atcccttccc aacccctttg caccggtccc actaaggggc agggtcgaga gaggaggggg 
25 541 gataggggga gcagacccct gagatctggg cataggcacc gcattctgat ctggacaaag 
601 tcgggacagc accatcccag ccccgaagcc agggccatgc cagcaggccc caccatggaa 
661 atcaaaacac cgcaccagcc agcagaatgg acattctgac atcgccagcc gacgccctga 
721 atcttggtgc agcaccaacc gcgtgcctgt gtggcgggac tggagggcac agttgaggaa 
781 ggagggtggt taagaaatac agtggggccc tctcgctgtc ccttgcccag ggcacttgca 
30 841 ttccagcctc gctgcatttg ctctctcgat tcccctttcc tcctcactgc ctcccaagcc 
901 caccctactc caaaataatg tgtcacttga tttggaacta ttcaagcagt aaaagtaaat 
961 gaatcccacc tttactaaaa cactttctct gaacccccct tgcccctcac tgatcttgct 
1021 tttccctggt ctcatgcagt tgtggtcaat attgtggtaa tcgctaattg tactgattgt 
1081 ttaagtgtgc attagttgtg tctccccagc tagattgtaa gctcctggag gacagggacc 
35 1141 acctctacaa aaaataaaaa aagtacctcc cctgtctcgc acagtgtccc aggaccctgc 
1201 ggtgcagtag aggcgcacca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 



SEQ ID NO: 190 

Amino acid sequence of human NNAT variant ORF number 1 encoded by the DNA sequence 
40 shown in SEQ ID NO: 1 89. 

MAAVAAASAELLIIGWYIPRVLLQVFRYSLQKLAYTVSRTGRQVLGERRQRAPN 

SEQ ID NO: 191. 

gi|6754863|refJNMJ)10923.1| Mus musculus neuronatin (Nnat), transcript 

1 variantl,m rnagcggact ccgagaccag tagacctcgg cgaacccttg ctctcgacca 
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61 cccacccact ttcggaacca tggccgcagt ggcagcagcc tcggcagaac tgctcatcat 
12a cggctggtac atcttccgcg tgctgctgca ggtgttcctg gaatgctgca tttactgggt 
181 aggattcgct tttcgaaatc ctccagggac acagcccatt gcgagaagtg aggtgttcag 
241 gtactccctg cagaagctgg cgcacacggt gtcccggacc gggcggcagg tgctggggga 
5 301 gcgcaggcag cgagccccca actgaggccc cagctcccag ccctgggcgg ccgtgtcatc 
361 aggtgctcct gtgcttctcg accagcatgg gagccagtgc cgcgcaggaa tggggggtcc 
421 cctgtgttcc ctcgtcagag gagcacttgc caaggtcagt gaggggccgg taggcccccc 
4B1 agaaaagcag caccgacaat gatgaagata tcagttccct tcccagcccc tttgcccctg 
541 tcccactacc ggcgggtggg agaggagggg ggaagagggg agcaaccctc gagatatggg 
10 601 cgtaggcacc acattctgat ctggaccaag tcggaacagc accatctcag ccgcacaaga 
661 tcctaccatg aagatcgaac agcccatcaa ccagcagaat ggacattctg acatcaccag 
721 ctgaagccct acatctcggt gcagaagaga aagtgtcaac tgtgtgcagc atggggggag 
781 tggagggcgt gggtggtgga ggaagagggt taagaaaact agtggggccc ccttgctgcc 
841 ccttgcctat ggcacacata ttcctgcctt gctccctcat ttcccctttt ccccccgcct 
15 901 tcgaaagccc tccccaaaat gtgtcacttg atttggatat attcaaccag taattgaatc 
961 ccacctttac caaaacacgt tctctaaccc ccggcccttc actgatcttg cttatccctg 
1021 gtctcacgca gcagttgtgg tcaatattgt ggtagtcgct aattgtactg gtttaagtgt 
1081 gcattagtag tgtctcccca gctagattgt aagctcctgg agacagggac cacctccacc 
1141 aaaaataaaa aaatggacct ctcctgtctt gtagtgtcct aggaccctgc agggca 



SEQIDNO: 192 

Amino acid sequence of mouse NNAT encoded by the DNA sequence shown in SEQ ED NO: 
191. 

MAAVAAASAELLIIGWYIFRVLLQVFLECCIYWVQFAFHNPPGTQPIARSEVFRYSLQKL 
25 AHTVSRTGRQVLGERRQRAPN 

SEQIDNO: 193 

gi|16758385|ref]NM_053601.1| Rattus norvegicus neuronatin (Nnat), transcript variant 1, 
mRNA 

1 gcgaaccctt gctctcgacc acccacccac tttcggaacc atggccgcag tggcagcagc 

30 61 ctcggcagaa ctgctcatca tcggctggta catcttccgc gtgctgctgc aggtgttcct 
121 ggaatgctgc atttactggg taggattcgc ttttcgaaat cctccaggga cacagcccat 
181 tgcgagaagt gaggtgttca ggtactccct gcagaagctg gcgcacacgg tgtcccggac 
241 cgggcggcag gtgctggggg agcgcaggca ccgagccccc aactgaggcc ccatctccca 
301 gccctgggcg gccgtgtcat caggtgctcc tgtgcttctc gaccagcatg ggagccaatg 

35 361 ccgcgcagga atggggggtc ccctgtgctc cctcgtcaga ggagcacttg ccaaggtcag 
421 tgaggggccg gtaggtcccc agaaaagcag caccgacaat gatgaagaca tcagttcctt 
481 tcccagcccc cccccccttt gcccctgtcc catggccggc gggtgggaga ggatggggga 
541 agaggggagc aaccctcgag atatgggcgt aggcaccaca ttctgatctg gaccaagttg 
601 gaacagcacc atctcagccg cacagatcct accatggaga gctaacaccc caccaaccag 

40 661 cagaatggac attctgacat caccagctga aaccctgaat ctcggtgcag aagagaaagt 
721 gtcaactgcg tgcagcactg ggggagtgga gggtgtgggt ggtggaggaa gagggttaag 
781 aaaactagtg gggccctctt gctgtccctt gcctatggca cgcatattcc tgccttgctc 
841 cctcactccc cctctcccct gccttccaaa gccccacccc ccaaaaatgt gtcacttgat 
901 tcggacctat tcaaccagta attggatccc acctttacca aaacaccgtc tctgaccccc 

45 961 ggcccttcac tgatcttgct tatccctggt ctcacgcagc agttgtggtt gctattgtgg 
1021 tagtcgctaa ttgtactagt ttacgtgtgc attagttgtg tctccccggc tagattgtaa 
1081 gctcctggag acagggacca cctccacaaa aaataaaaaa acggacctcc cctgtcttgt 
1141 agtgtgctag gaccctgcag ggcagtgggg gtgcacca 

SEQIDNO: 194 
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Amino acid sequence of rat NNAT encoded by the DNA sequence shown in SEQ ID NO' 
193. 

MAAVAAAS AELL I IG WY I FTIVLLQVFLECCI YWVGFAFRNPPGTQPI ARSEVFRYSLQKL 
AHTVSRTGRQVLGERRHRAPN 

5 SEQ ID NO: 195 

gill7157992lreflNM_058l64.ll Homo sapiens olfactomedin 2 (OLFM2), mRNA 

1 ctaggtcggg acgccgggct aggggcgcgt catgtggccg ctcacggtcc cgccgccgct 
61 gctgctgctg ctgtgctcag gcctggccgg acagactctc ttccagaacc cagaagaggg 
121 ctggcagctg tacacctcag cccaggcccc tgacgggaaa tgcatctgca cggccgtgat 
10 181 cccagcgcag agtacctgct ctcgagatgg caggagtcgg gagctgcggc aactgatgga 
241 gaaggtccag aacgtctccc agtccatgga ggtccttgag ttgcggacgt atcgcgacct 
301 ccagtatgta cgcggcatgg agaccctcat gcggagcctg gatgcgcggc tccgggcagc 
361 tgatgggtcc ctctcggcca agagcttcca ggagctgaag gacaggatga cggaactgtt 
421 gcccctgagc tcggtcctgg agcagtacaa ggcagacacg cggaccattg tacgcttgcg 
15 481 ggaggaggtg aggaatctct ccggcagtct ggcggccatt caggaggaga tgggtgccta 
541 cgggtatgag gacctgcagc aacgggtgat ggccctggag gcccggctcc acgcctgcgc 
601 ccagaagctg ggctgtggga agctgaccgg ggtcagtaac cccatcaccg ttcgggccat 
661 ggggtcccgc ttcggctcct ggatgactga cacgatggcc cccagtgcgg atagccgggt 
721 ctggtacatg gatggctatt acaaaggccg ccgggtcctg gagttccgta ccctgggaga 
20 781 cttcatcaaa ggccagaact ttatccagca cctgctgccc cagccgtggg cgggcacggg 
841 ccacgtggtg tacaacggct ccctgttcta taacaagtac cagagcaacg tggtggtcaa 
901 ataccacttc cgctcgcgct ctgtgctggt gcagaggagc ctcccgggcg ccggttacaa 
961 caacaccttc ccctactcct ggggcggctt ctccgacatg gacttcatgg tggacgagag 
1021 cgggctctgg gctgtgtaca ccaccaacca gaacgcgggc aacatcgtgg tcagccggct 
25 1081 ggacccgcac accctcgagg tcatgcggtc ctgggacacc ggctacccca agcgcagcgc 
1141 tggcgaggcc ttcatgatct gcggtgtgct ctacgtgacc aactcccacc tggctggggc 
1201 caaggtctac ttcgcctatt ttaccaacac gtccagttac gagtacacgg acgtgccctt 
1261 ccacaaccag tattcccaca tctcgatgct ggattacaac ccccgggagc gcgccctcta 
1321 tacctggaac aacggccacc aggtgctcta caatgtcacc ctgtttcacg tcatcagcac 
30 1381 ctctggggac ccctgagcca atgctgtggc tcgggctgct gcctgggggg cctccggggg 
1441 ctgggggccc ttttcattct gcctgtgtcc ctcaagggtg atctctctgt ctctgtcacg 
1501 ccctttctcc ccgccttttt gctgggcttt tgttctctgc ctatgtattt ctgtctattt 
1561 tttcaatttc ccctcttctc ctttattgat ctctgctttt aatacaccac ttctttcttt 
1621 ctgccttttt atggatgtct ttttcttttt atggctctgg ttctccagtt ctttccgtct 
35 1681 ctgcctctct ctgtctctct ctctctgtcc ttccacccct ccctccttgc ttcccaccca 
1741 ttcctcatcc ctcactccca cccccacccc cacccccagg agttgagtgc atggatctgt 
1801 ttcttttttt atttacactt tttctttccg gtttgccgga ataaacagga cctttgacat 
1861 ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 

SEQ ID NO: 196 

40 Amino acid sequence of human OLFM2 encoded by the DNA sequence shown in SEQ ID 
NO: 195. 

MWPLTVPPPLLLLLCSGLAOQTIiFQNPEBGWQLYTSAQAPDGKCICTAVIPAQSTCSRDG 
RSRGLRQLMBKVQK^SQSMBVLEIJITYI^LQYVRGMETL^SLDARLRAADGSLSAKSFQ 
ELKDRMTELLPLSSVLEQYKADTRTIVRLREEVRNliSGSLAAIQEEMGAYGYEDLQQRVM 
45 ALEARLHACAQKLGCGKLTGVSNPITVRAMGSRFGSW 

RVLEFRTLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLF 

QRSLPGAGYNNTFPYSWGGFSDMDFMVDESGLWAVYTTNQNAGNIVVSRLDPHTIiEVMRS 
WDTGYPKRSAGEAFM1CGVLYVTNSH1AGAKVYFAYFTNTSSYEYTDVPFHNQYSHISML 
DYNPRERALYTWNNGHQVLYNVTLFHVISTSGDP 
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SEQIDNO: 197 

Amino acid sequence of human OLFM2, a soluble active secreted form derived from SEQ rD 
NO:196. 

QTLFQNPEEGWQLYTSAQAPDQKCICTAVIPAQSTCSRDGRSRELRQLMBKVQNVSQSME 
5 VLELRTYRDLQYVRGMETLMRSLDARU^^ 

ADTRTIVRLREEVRNLSQSLAAIQEEMGAYGYEDLQQRVMALEARLHACAQKLGCGKLTG 
VSNPITVRAMGSRFGSWMTDTMAPSADSRVWYMDGYYKGRRVLBFRTLGDPIKGQNFIQH 
LLPQPWAGTGHVVYNGSLFYNKYQSNVVWYHPRSRSVLVQRSLPGAGYNNTFPYSWGGF 
SDMDFMVDESGLWAVYTTNQNAGNIWSRUDPHTLEVMRSWDTGYPKRSAGEAFM 
1 0 YVTNSHLAGAKVYFAYFTNTS S YEYTDVPFHNQYSH I SMLD YNPRERAL YTWNNGHQVL Y 
NVTLFHVI STSGDP 

SEQ ID NO: 198 

gi|31343362|ref]NMJ 73777.2| Mus musculus hypothetical protein A030009A06 
(A030009A06), mRNA 

15 1 gagttgagcg gcaggctacc tctggcttct tacacatacc tgacccatca ggcccccaga 

61 accactttac agatgaggaa actgagacag accggaacaa ctattgctgg aggtcagact 
121 ctcttccaga gcccggagga gggctggcag ctttatacgt cagcccaggc acctgatggc 
X81 aagtgcgtct gcacagccgt gatccctgcg cagagcacct gtgcccgaga cggtcggagc 
241 agagagcttc ggcaactcat ggagaaggtc cagaatgtgt cccagtccat ggaggtcctt 
20 301 gagctaagga cattccggga tctccagtat gttcgcagca tggagaccct catgcggagc 
361 ctggatgcaa ggctcagggc agccgatggg tcagtctcag ccaaaagctt tcaggaactg 
421 aaggacagga tgacagagct gctgcccctg agttcagtgc tggagcagta caaagcagac 
481 acacgaacca ttgtgcgcct gcgggaggag gtgaggaacc tctctggcaa cttggctgcc 
541 atccaagagg aaatgggtgc ctacgggtac gaggacttgc aacagcgcgt gatggccctg 
25 601 gaagcccgac tccatgcctg cgcgcagaag ctgggctgcg ggaagctgac aggcgtcagt 
661 aaccctatta ccatccgggc catggggtcc cgcttcggtt cctggatgac tgacacaatg 
721 gcccccagtg cagacagcag ggtctggtac atggatggtt attacaaagg ccgccgagtg 
781 ctggagttcc gtactctggg agacttcatc aaaggccaga acttcatcca gcacctgctg 
841 ccacagccat gggcaggtac gggccatgtg gtatacaatg gctctctctt ctacaataag 
30 901 taccagagca atgtggtggt caagtaccac ttccggtccc gctcggtgct ggtgcagagg 
961 agcctcccgg gggctggtta caataacacc ttcccctatt cctggggcgg cttctcggac 
1021 atggacttca tggtagacga gagtgggctg tgggctgtgt ataccaccaa ccagaatgcg 
1081 ggcaacatcg tagtcagtcg gctggaccct cacaccctgg aggtcgtgag atcctgggac 
1141 accgggtacc ctaagcgcag cgccggcgag gccttcatga tctgcggtgt cctctatgtg 
35 1201 accaactctc acctggccgg agccaaggtc tactttgcgt acttcaccaa cacgtccagc 
1261 tatgagtaca cggatgtgcc cttcaacaac cagtattcgc acatctcgat gctggattac 
1321 aaccccaggg agcgggccct gtatacctgg aacaacgggc accaggtgct gtacaacgtc 
1381 accctgttcc acgtcatcag cactgccggg gacccctagg tgcccctgca agggctttgg 
1441 ggagccttcc cacatcgcct gtgaccccca ccccagcctt ctcttggtca tgcccttgcc 
40 1501 ttcctagatt ccgtctcccc acttccccag cccagctttc tgttctcagt atctctaccc 
1561 atgcatttcc cccattttat tgatctctgc ttttgatact ccacttctga gtctttctgc 
1621 ctttttatgg atgcttctct tccttattga tcgaacctct cttctcctct ccctccatct 
1681 actctccctc ctcttccttt cccactttcc cacattcctt acccctcact cccaccctat 
1741 ccctggatct gagtgcatgg attttgtttt ttaaatttat tatttacatg ttttctttcc 
45 1801 gggttgccag aataaaccgg accttt 

SEQIDNO: 199 

Amino acid sequence of mouse OLFM2 encoded by the DNA sequence shown in SEQ ID 
NO: 198. 

MRKLRQTGTT I AGGQTLFQS PEEGWQL YTS AQAPDGKCVCTAVI PAQSTCARDGRSREIiR 
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QI^EKVQNVSQSMEVLELRTFT^LQYVR^ 

TELLPLSSVLEQYKADTRTIVRLREEVRNLSGNIAAIQEBMGAyGyEDLQQRVMALBA^ 
HACAQKLGCGKLTGVSNPITIRAMGSRFGSWMTDTMAPSADSRVW^ 
TLGDFIKGQNFIQHLLPQPWAGTGHVVYNGSLFyNICyQSNVVVKYHFRSRSVLVQRSLro 
5 AGYNNTFPYSWGGFSDMDFrWDESGLWAVYTTNQNAGNIWS^ 
KRSAGEAFMICGVLYVTNSHIAGAKVYFAYFTOTSSYEYTOT^ 
RALYTWNNGHQVLYNVTLFHVISTAGDP 



SEQIDNO: 200 



gi|34860388|re£lXM_233742.2| Rattus norvegicus similar to olfcctomedin 2; neuronal 
1 0 olfactomedin related ER localized protein 2; noelin 2 (LOC3 1 3783), mRNA 



15 



20 



25 



30 



35 



l 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 



ccccatcccc 
ccgacccctc 
gaacacctgc 
ccgctgctca 
cagacgctgc 
aaaatcactc 
ccggatggca 
ggtcggagca 
gaagtccttg 
atgcggagcc 
caggaactga 
aaagcagaca 
ctggctgcca 
atggccctgg 
ggcgtcagta 
gacacgatgg 
aggtgttcag 
atggatggtt 
aaaggccaga 
gtatacaatg 
ttccggtccc 
ttcccctatt 
tgggctgtgt 
cacaccctgg 
gccttcatga 
tactttgcgt 
cagtactcgc 
aacaacgggc 
gacccctag 



atcccaccgg 
caccccactg 
tcttttaggg 
agatcggggc 
cctcgctcgt 
tcttccagag 
aatgcatctg 
gagagcttcg 
agctaaggac 
tggatgcaag 
aggacaggat 
cacggaccat 
tccaggagga 
aggcccgact 
accccattac 
cccccagtgc 
tgaacattct 
attacaaagg 
acttcatcca 
gatctctgtt 
gctccgtgtt 
cctggggcgg 
ataccaccaa 
aggtcgtgag 
tctgtggtgt 
acttcaccaa 
acatctccat 
accaggtgct 



ccaccgcccc 
gggtggatag 
tggtggtgcg 
ggtgctgagc 
gggactcaac 
cccagaagag 
cacagctgtg 
ccagctcatg 
ataccgggat 
gctcaggaca 
gacagagctg 
tgtgcgcctg 
aatgggtgcc 
ccatgcctgc 
catccgagcc 
agacagtcgg 
tagcccagtg 
ccgccgagtg 
gcacctgctg 
ctacaacaag 
ggtgcagagg 
cttctcggac 
ccagaacgcg 
gtcctgggac 
cctctacgtg 
cacgtccagc 
gctggattac 
gtacaacgtc 



cttggacccg 
cgacatcttg 
gggcatgagg 
accatggcca 
agcaccgtgt 
ggctggcagc 
attcctgcac 
gagaaggtcc 
ctccagtatg 
gccgatgggt 
ctgcccctga 
egggaggagg 
tacgggtacg 
•gcgcagaagc 
atgggatccc 
gtgagtgacc 
tccagtgaat 
ctggagttcc 
ccgcagccat 
taccagagca 
agcctcccgg 
atggacttca 
ggcaacattg 
actgggtacc 
accaactctc 
tatgagtaca 
aaccccaggg 
accctcttcc 



gtcatctcgc 
agtgccgagg 
cgaggggcgc 
tggtcactaa 
cccgcgcggg 
tgtacacgtc 
agagcacctg 
agaatgtgtc 
ttcgcagcat 
cagtctcagc 
gttcagtgct 
tgaggaacct 
aggacttgca 
tgggctgcgg 
gcttcggctc 
ccactccagt 
ctgtgatccc 
gtactttggg 
gggcaggtac 
atgtggtggt 
gggctggtta 
tggtagacga 
tggtcagccg 
ctaagcgcag 
acctggccgg 
cggatgtgcc 
agcgggccct 
acgtcatcag 



catcgggacc 
aaaggggacg 
gatgtcggtg 
ctggatgtcg 
ctcctcagag 
agcgcaggcg 
tgcccgagat 
tcagtccatg 
ggagaccctc 
caaaagcttt 
ggagcagtac 
ctcaggcaac 
gcagcgcgtg 
gaagctgaca 
ctggatgact 
gactgcctgc 
agtctggtac 
agacttcatc 
gggccatgtg 
taagtaccac 
caataacacc 
gagcgggctg 
gctggaccct 
cgccggtgag 
agccaaggtc 
cttccacaac 
gtatacctgg 
cactgccggg 



40 SEQIDNO: 201 



Amino acid sequence of rat OLFM2 encoded by the DNA sequence shown in SEQ ID NO: 
200. 



MSVPLLKIGAVLSTOAMVTNWMSQTLPSLVGIiNSTVSRAGSSBKITLFQSPEEGWQLYTS 
AQAPDGKCICTAVIPAQSTCARDGRSRELRQLMEKVQNVSQSMEVLELRTYRDLQYVRSM 

45 ETLMRSLDARLRTADGSVSAKSFQELKDRMTELLPLSSVLEQYKADTRTIVRLREEVRNL 
SGNLAAIQEEMGAYGYBDLQQRVMALEARXiHACAQKIiGCGKLTGVSNPITIRAMGSRFGS 
WMTDTMAPSADSRVSDPTPAnTACRCSVNILSPVSSESVIPVWYMDGYYKGRRVLEFRTLG 
DFIKGQNFIQHLLPQPWAGTGHWYNGSLFYNKYQSNVWKYHFRSRSVLVQRSLPGAGY 
NNTFPYSWGGFSDMDFWDESGLWAVYTrNQNAGNIWSR 

50 AGEAFMICGVLYVTNSHLAGAKVYFAYFTNTSSYEYTDVPFHNQYSHISMLDYNPRER^ 
YTWNNGHQVL YNVTLFHVI S TAGDP 
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gi|7657070|reflNMJ)14322.1| Homo sapiens opsin 3 (encephalopsin, panopsin) (OPN3), 
mRNA 

1 cgagccccgc cgcaagctga gcgcctccgc ccgccaggcg cgccggcgcc gggccatgta 
5 61 ctcggggaac cgcagcggcg gccacggcta ctgggacggc ggcggggccg cgggcgctga 
121 ggggccggcg ccggcgggga cactgagccc cgcgcccctc ttcagccccg gcacctacga 
181 gcgcctggcg ctgctgctgg gctccattgg gctgctgggc gtcggcaaca acctgctggt 
241 gctcgtcctc tactacaagt tccagcggct ccgcactccc actcacctcc tcctggtcaa 
301 catcagcctc agcgacctgc tggtgtccct cttcggggtc acctttacct tcgtgtcctg 
10 361 cctgaggaac ggctgggtgt gggacaccgt gggctgcgtg tgggacgggt ttagcggcag 
421 cctcttcggg attgtttcca ttgccaccct aaccgtgctg gcctatgaac gttacattcg 
481 cgtggtccat gccagagtga tcaatttttc ctgggcctgg agggccatta cctacatctg 
541 gctctactca ctggcgtggg caggagcacc tctcctggga tggaacaggt acatcctgga 
601 cgtacacgga ctaggctgca ctgtggactg gaaatccaag gatgccaacg attcctcctt 
15 661 tgtgcttttc ttatttcttg gctgcctggt ggtgcccctg ggtgtcatag cccattgcta 
721 tggccatatt ctatattcca ttcgaatgct tcgttgtgtg gaagatcttc agacaattca 
781 agtgatcaag attttaaaat atgaaaagaa actggccaaa atgtgctttt taatgatatt 
841 caccttcctg gtctgttgga tgccttatat cgtgatctgc ttcttggtgg ttaatggtca 
901 tggtcacctg gtcactccaa caatatctat tgtttcgtac ctctttgcta aatcgaacac 
20 961 tgtatacaat ccagtgattt atgtcttcat gatcagaaag tttcgaagat cccttttgca 
1021 gcttctgtgc ctccgactgc tgaggtgcca gaggcctgct aaagacctac cagcagctgg 
1081 aagtgaaatg cagatcagac ccattgtgat gtcacagaaa gatggggaca ggccaaagaa 
1141 aaaagtgact ttcaactctt cttccatcat ttttatcatc accagtgatg aatcactgtc 
1201 agttgacgac agcgacaaaa ccattggggt ccaaagtttg atgttaatcc aagttcgtcc 
25 1261 tttgtaggaa tgaaggatgg caacgaaagg tggggcct.ta aattggatgc cacttttgga 
1321 ctttcatcat cctcctgaag aagaagtgtc tggaataccc gttctatgta atatcaacag 
1381 aaccttgtgg tccagcagga aatccgaatt gcccatatgc tcttgggcct caggaagagg 
1441 ttgaacaaaa acaaattctt ttaattcaac gggtgcttta cataatgaaa aaaccacttg 
1501 tgcacacgat gggcatctaa catcatcatc ttctaatgtg ttggagattt tcatttcaaa 
30 1561 tatatttttt aaattactct attttccaaa acacgtaatg catttttctc gaaaatacct 
1621 tactgtaaaa ataactgtcg cgtacacatg tgtgaagtag ctagaacata ctgaattttt 
1681 tttgtactgt tggactctat tcagtgtcat gtcctatatc tgatcaagtt atcaaggaga 
1741 taattctaga atgaaaaaga aaatcctctt gttggaaaca aaagacgttt tatatgtgca 
1801 gtatgacaaa gaggagtttc agagacaact ttgaatcctt gtcagcctgg agaccagcac 
35 1861 cagaggaatc tacaaggcaa actcccatat atttgcttcc cccaaattgc tgcccctaca 
1921 gactcaaagc tctttttctt tgttttgttg tttctctaaa aatttactgt tctttgtcga 
1981 tgctatataa gccagggagt tctaagacgc cagctctttg agatttgctc attcccctgt 
2041 atttcccaca tatatattac atatacccgc taataaattt atgtttgttt taaaaaaaaa 
2101 aaaaaaaaaa 

40 SEQ ID NO: 203 

Amino acid sequence of human OPN3 encoded by the DNA sequence shown in SEQ ID NO: 

202. 

MYSGNRSGGHGYV?DGGQAAaABGPAPAGTLSPAPLFSPQTYERIiALLLGSIGLLGVGNNL 
LVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLPGVTFTFVSCIJWGWVWDTVGCVWDGFS 
45 GSL FG I VS IATLTVLAYERY I R WHAR V I NFS WAWRAI TY I WLYSLAWAGAPLLG WNRYI 
LDVHGU3CTVDWKSKDANDSSFVLFLFLGCLVVPLGVIAHCYGHILYSIRMLRCVEDLQT 
IQVIKILKYEKKLAKMCFLMIFTFLVCWMPYIVICFLVWGHGHLVTPTISIVSYLFAKS 
NTVYNPVIYVFMIRKFRRSLLQLLQLRLLRCQRPAKDLPAAGSEMQIRPIVMSQKDGDRP 
KKKVTFNSSSIIFIITSDESLSVDDSDKTIGVQSLMLIQVRPL 

50 SEQ ID NO: 204 
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(gi|27463268:201-667, 24621-24703, 42570-42821, 45876-47409) Homo sapiens 
encephalopsin splice variant 1 (OPN3) gene, alternatively spliced; and choroideremia-like 
protein (CHML) gene, complete cds 

1 gcggcggagc cccaacccca cccagtgcgg agcgcgccgc gagccccgcc gcaagctgag 
5 61 cgcctccgcc cgccaggcgc gccggcgccg ggccatgtac tcggggaacc gcagcggcgg 
121 ccacggctac tgggacggcg gcggggccgc gggcgctgag gggccggcgc cggcggggac 
181 actgagcccc gcgcccctct tcagccccgg cacctacgag cgcctggcgc tgctgctggg 
241 ctccattggg ctgctgggcg tcggcaacaa cctgctggtg ctcgtcctct actacaagtt 
301 ccagcggctc cgcactccca ctcacctcct cctggtcaac atcagcctca gcgacctgct 
10 361 ggtgtccctc ttcggggtca cctttacctt cgtgtcctgc ctgaggaacg gctgggtgtg 
421 ggacaccgtg ggctgcgtgt gggacgggtt tagcggcagc ctcttcggtt ggatcagcca 
481 gcttcaggca gccactaggg aagccagagc ctccatgggt ccagtgcagc aaggcactat 
541 ctgcatgcag cttcgttgtg tggaagatct tcagacaatt caagtgatca agattttaaa 
601 atatgaaaag aaactggcca aaatgtgctt tttaatgata ttcaccttcc tggtctgttg 
15 661 gatgccttat atcgtgatct gcttcttggt ggttaatggt catggtcacc tggtcactcc 
721 aacaatatct attgtttcgt acctctttgc taaatcgaac actgtataca atccagtgat 
781 ttatgtcttc atgatcagaa agtttcgaag atcccttttg cagcttctgt gcctccgact 
841 gctgaggtgc cagaggcctg ctaaagacct accagcagct ggaagtgaaa tgcagatcag 
901 acccattgtg atgtcacaga aagatgggga caggccaaag aaaaaagtga ctttcaactc 
20 961 ttcttccatc atttttatca tcaccagtga tgaatcactg tcagttgacg acagcgacaa 
1021 aaccaatggg tccaaagttg atgtaatcca agttcgtcct ttgtaggaat gaagaatggc 
1081 aacgaaagat ggggccttaa attggatgcc acttttggac tttcatcata agaagtgtct 
1141 ggaatacccg ttctatgtaa tatcaacaga accttgtggt ccagcaggaa atccgaattg 
1201 cccatatgct cttgggcctc aggaagaggt tgaacaaaaa caaattcttt taattcaacg 
25 1261 ggtgctttac ataatgaaaa aaccacttgt ggcacacgat gggcatctaa catcatcatc 
1321 ttctaatgtg ttggagattt tcatttcaaa tatatttttt aaattactct attttccaaa 
1381 acacgtaatg catttttctc gaaaatacct tactgtaaaa ataactgtcg cgtacacatg 
1441 tgtgaagtag ctagaacata ctgaattttt tttgtactgt tggactctat tcagtgtcat 
1501 gtcctatatc tgatcaagtt atcaaggaga taattctaga atgaaaaaga aaatcctctt 
30 1561 gttggaaaca aaagacgttt tatatgtgca gtatgacaaa gaggagtttc agagacaact 
1621 ttgaatcctt gtcagcctgg agaccagcac cagaggaatc tacaaggcaa actcccatat 
1681 atttgcttcc cccaaattgc tgcccctaca gactcaaagc tctttttctt tgttttgttg 
1741 tttctctaaa aatttactgt tctttgtcga tgctatataa gccagggagt tctaagacgc 
1801 cagctctttg agatttgctc attcccctgt atttcccaca tatatattac atatacccgc 
35 1861 faataaattt atgtttgttt ttctcttgtc aatctgtctt ttgttatagg ggccccagcc 
1921 aaggaaccta aagtgggtag aaggaaaaat tattttttct ttccctacaa actgaacatg 
1981 gattattaga actcaaggtt ttcattgaca atatagaaaa gaaacactga atcattttat 
2041 tttattgccc aatttttatt tcttatatga ctctagtgtt teat ct teat aattaatcat 
2101 gtttgaagga tttctgagtg aeteagcagc ctgttaaaga aggatgaacc aaagaaaaca 
40 2161 tttcactaaa tgtgctttta aaaatcaagt gtattgctgg ttctgetgea gtatgtagtc 
2221 gaagaataaa ttagtaaatt gcttctgagg gtctgaaatt gaataaagta atggctttgt 
2281 atttctataa aagttgtctc cccttgtttc ctttccattc tggcacatgt agacat 



SEQIDNO:205 

45 Amino acid sequence of human OPN3 variant ORF number 1 encoded by the DNA sequence 
shown in SEQ ID NO: 204. 

MYSGNRSGGHGYWDGGGAAGAEGPAPAGTLS PAPLFS PGTYERLALLLGS IGLLGVGNNL 
LVLVLYYKFQRLRTPTHLLLVNISLSDLLVSLFGVT 

GSLPGWISQLQAATREARASMGPVQ<y3TICMQLRCVEDL0TIQVIKILKYEKKXiAKMCFL 
50 MIF^FLVCWMPYIVICFLVVNGHGHLVTPTISIVSYLFAKSNTVYNPVIYVFMIRKFRRS 
LLQLLCLRLLRCQRPAKDLPAAGSEMQIRPI VMSQKDGDRPKKKVTFNSSS 1 1 F I ITSDE 
SLSVDDSDKTNGSKVDVIQVRPL 

SEQ ID NO: 206 
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(gi|27463268:201-667, 35987-36306, 42570-42821, 45876-47409) Homo sapiens 
encephalopsin splice variant 1 (OPN3) gene, alternatively spliced; and choroideremia-like 
protein (CHML) gene, complete cds 

1 gcggcggagc cccaacccca cccagtgcgg agcgcgccgc gagccccgcc gcaagctgag 
5 61 cgcctccgcc cgccaggcgc gccggcgccg ggccatgtac tcggggaacc gcagcggcgg 
121 ccacggctac tgggacggcg gcggggccgc gggcgctgag gggccggcgc cggcggggac 
181 actgagcccc gcgcccctct tcagccccgg cacctacgag cgcctggcgc tgctgctggg 
241 ctccattggg ctgctgggcg tcggcaacaa cctgctggtg ctcgtcctct actacaagtt 
301 ccagcggctc cgcactccca ctcacctcct cctggtcaac atcagcctca gcgacctgct 
10 361 ggtgtccctc ttcggggtca cctttacctt cgtgtcctgc ctgaggaacg gctgggtgtg 
421 ggacaccgtg ggctgcgtgt gggacgggtt tagcggcagc ctcttcggga ttgtttccat 
481 tgccacccta accgtgctgg cctatgaacg ttacattcgc gtggtccatg ccagagtgat 
541 caatttttcc tgggcctgga gggccattac ctacatctgg ctctactcac tggcgtgggc 
601 aggagcacct ctcctgggat ggaacaggta catcctggac gtacacggac taggctgcac 
15 661 tgtggactgg aaatccaagg atgccaacga ttcctccttt gtgcttttct tatttcttgg 
721 ctgcctggtg gtgcccctgg gtgtcatagc ccattgctat ggccatattc tatattccat 
781 tcgaatgctt cgttgtgtgg aagatcttca gacaattcaa gtgatcaaga ttttaaaata 
841 tgaaaagaaa ctggccaaaa tgtgcttttt aatgatattc accttcctgg tctgttggat 
901 gccttatatc gtgatctgct tcttggtggt taatggtcat ggtcacctgg tcactccaac 
20 961 aatatctatt gtttcgtacc tctttgctaa atcgaacact gtatacaatc cagtgattta 
1021 tgtcttcatg atcagaaagt ttcgaagatc ccttttgcag cttctgtgcc tccgactgct 
1081 gaggtgccag aggcctgcta aagacctacc agcagctgga agtgaaatgc agatcagacc 
1141 cattgtgatg tcacagaaag atggggacag gccaaagaaa aaagtgactt tcaactcttc 
1201 ttccatcatt tttatcatca ccagtgatga atcactgtca gttgacgaca gcgacaaaac 
25 1261 caatgggtcc aaagttgatg taatccaagt tcgtcctttg taggaatgaa gaatggcaac 
1321 gaaagatggg gccttaaatt ggatgccact tttggacttt catcataaga agtgtctgga 
1381 atacccgttc tatgtaatat caacagaacc ttgtggtcca gcaggaaatc cgaattgccc 
1441 atatgctctt gggcctcagg aagaggttga acaaaaacaa attcttttaa ttcaacgggt 
1501 gctttacata atgaaaaaac cacttgtggc acacgatggg catctaacat catcatcttc 
30 1561 taatgtgttg gagattttca tttcaaatat attttttaaa ttactctatt ttccaaaaca 
1621 cgtaatgcat ttttctcgaa aataccttac tgtaaaaata actgtcgcgt acacatgtgt 
1681 gaagtagcta gaacatactg aatttttttt gtactgttgg actctattca gtgtcatgtc 
1741 ctatatctga tcaagttatc aaggagataa ttctagaatg aaaaagaaaa tcctcttgtt 
1801 ggaaacaaaa gacgttttat atgtgcagta tgacaaagag gagtttcaga gacaactttg 
35 1861 aatccttgtc agcctggaga ccagcaccag aggaatctac aaggcaaact cccatatatt 
1921 tgcttccccc aaattgctgc ccctacagac tcaaagctct ttttctttgt tttgttgttt 
1981 ctctaaaaat ttactgttct ttgtcgatgc tatataagcc agggagttct aagacgccag 
2041 ctctttgaga tttgctcatt cccctgtatt tcccacatat atattacata tacccgctaa 
2101 taaatttatg tttgtttttc tcttgtcaat ctgtcttttg ttataggggc cccagccaag 
40 2161 gaacctaaag tgggtagaag gaaaaattat tttttctttc cctacaaact gaacatggat 
2221 tattagaact caaggttttc attgacaata tagaaaagaa acactgaatc attttatttt 
2281 attgcccaat ttttatttct tatatgactc tagtgtttca tcttcataat taatcatgtt 
2341 tgaaggattt ctgagtgact cagcagcctg ttaaagaagg atgaaccaaa gaaaacattt 
2401 cactaaatgt gcttttaaaa atcaagtgta ttgctggttc tgctgcagta tgtagtcgaa 
45 2461 gaataaatta gtaaattgct tctgagggtc tgaaattgaa taaagtaatg gctttgtatt 
2521 tctataaaag ttgtctcccc ttgtttcctt tccattctgg cacatgtaga cat 



SEQIDNO: 207 

Amino acid sequence of human OPN3 variant ORF number 2 encoded by the DNA sequence 
50 shown in SEQ ID NO: 206, 

M YSGNRSGGHGYWDGGG AAG AEG PAPAGTLS PAPLFS PGT YERLALLLGS IOLLOVGNNL 
LVLVL Y YKFQRLRTPTHLLLVNI SLSDLLVSLPG VTFTFVSCLRNGWVWDTVGCVWDGPS 
GSLPGIVSIATLTVIAY6RYIRVVHARVIHFSWAWRAITYIWLYSLAWAGAPLLGWNRYI 
LDVHGLGCTVDWKSKDANDSSFVLFLFIjGCLWPLGVIAHCYGHILY8IRI^RCVSDLQT 
55 IQVIKILKYEKKLAKMCFLMIFTFIiVCWMPYIVICFLVVNGHGHLVTPTISIVSYIiFAKS 
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NTVYNPVIYVFMIRKFRRSLLQLLCLRLLRCQRPAKDLPAAGSBMQIRPIVMSQKDGDRP 
KKKVTFNSSSIIFIITSDESLSVDDSDKTNGSKVDVIQVRP1, 

SEQIDNO: 208 

gi|6753709|reflNM_010098.1| Mus rausculus opsin (encephalopsin) (Opn3), mRNA 

5 1 tcgggccgcc cggctcaccg agccctctct ctcacggcgc gccggcgtcg ggccatgtac 

61 tcggggaacc gtagtggcga ccagggctac tgggaggacg gggcgggcgc cgagggcgca 
121 gcaccggcgg gcacgcggag ccccgcgcct ctcttcagcc ccaccgcgta cgagcgcctg 
181 gcgctgctac tcggctgcct cgcgctgctg ggcgtcggcg gcaacctgct ggtgctgctt 
241 ctctactcca agttcccgag actgcgcacg cccacccacc tcttcctggt caacctgagc 
10 301 ctgggcgacc tgctggtatc cctgttcgga gtcaccttca ccttcgcgtc gtgcctgcgg 
361 aacggctggg tgtgggacgc cgtgggctgc gcgtgggacg ggtttagcgg cagcctcttt 
421 gggttcgttt ccattaccac cctcactgtg ctggcctatg aacgttatat ccgtgtggta 
481 catgccagag tgatcaactt ttcctgggcc tggagggcca ttacctatat ctggctctac 
541 tccttggcat gggcaggagc acctctcctg ggctggaaca ggtacatcct agacatacat 
15 601 ggactgggct gtaccgtgga ctggagatcc aaggatgcca acgactcttc ctttgtgctc 
661 ttcctgtttc tcggctgcct ggtggtgccc gtgggcatca tagcccattg ctacggccac 
721 attctctatt ctgttcgaat gcttcgctgt gttgaagatc ttcagaccat tcaagtgatc 
781 aagatgctaa gatatgaaaa gaaagtagca aagatgtgct ttttgatggc ctttgtcttt 
841 ctcacctgct ggatgcctta cattgtgacc cgcttcttgg tggtcaatgg ctatggacac 
20 901 ctggtcaccc caactgtgtc tattgtttct tatctctttg ctaaatcgag cactgtgtac 
961 aacccagtta tctacatctt catgaacaga aagtttcgga ggtcccttct gcagctccta 
1021 tgcttccgcc tgctgagatg ccagcggcct gctaaaaacc tcccagcggc tgagagtgaa 
1081 atgcacatca ggcccatcgt gatgtcacag aaagatgggg acaggccaaa gaagaaagtg 
1141 acctttaact cttcctctat catctttatc atcaccagtg atgagtccct gtcagtcgag 
25 1201 gacagtgaca gaagcagcgc atctaaggtc gatgtcatcc aagtgcgtcc tctataagaa 
1261 tggaagacag agtcctatat ccagccacca caatgttcct gtcaggagtg ccccagagtc 
1321 cccatttggt gtaatagtga cagaacctct gtggccctgt gggaaatccg aatcacccac 
13 8 i atgttgtctg tattcaagaa gcgactgagc aagacaaatt attttaactc aatgggtgct 
1441 ttataacttg agaacctcct ggtggcacaa gatgagcatc tggcgtcatc gctactatgc 
30 1501 caggatttaa tatttgaatg actgtatttg ccaaagcaca taatacattt tgtttatcaa 
1561 atgtatttca ctataaaaat aacaatctca tatacacgtg tacaatgact ggaacatctg 
1621 gagtacgttg cggtgttgag tcctgttctg tgccatgttg tgtgtttgac caattaaatt 
1681 tattgaagag ataataaatc aatccctact tttatctgaa aaaaaaaaaa aaaaaaa 



35 SEQIDNO: 209 

Amino acid sequence of mouse OPN3 encoded by the DNA sequence shown in SEQ ID NO: 
208. 

MYSGNRSGDQGYWEDGAGAKQAAPAGTRSPAPLPSPTAYERLALLLGCLALLGVGGNLLV 
LLLYSKFPRLRTPTHLPLVNLSLGDLLVSLFGVTFTFASCLRNGWVWDAVGCAWDGPSGS 
40 LFG FVS I TTIiTVLAYERY IRWHARVINFS WAWRAITY I WL Y SLAW AGAPLLG WNR Y I LD 
IHGLGCTVDWRSKDANDSSFVLPLFLGCLWPVGIIAHCYGHILYSVRMIiRCVEDLQTIQ 
VI KMLR YEKKVAKMCFLMAF VPLTCWMPYI VTRFL WNGYGHLVTPTVS IVSYLFAKS ST 
VYKPVIYXFNNRKFRRSUjQLLCFRLLRCQRPAKKIiPAAESEMHIRPIVNSQKDGDRPKK 
KVTFNSSSIIFIITSDESLSVBDSDRSSASKVDVIQVRPL 

45 SEQIDNO: 210 

ENSRNOT00000005072 cDNA sequence, EnsEMBL transcript [Rattus norvegicus] 

1 atgtactcgg ggaaccgtag cggcggccag ggctactggg aggacggggc gggcgccgag 
61 ggcgcagcac cggccggcac gcggagcccc gcgcctctct tcagccccac cgcgtacgag 
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121 cgcctggcgc tgctgctcgg ttgcctcgcg ctgctgggcg tcggcggcaa cctgctggtg 
181 ctgcttctct actccaagtt cccgcgactg cgcacgccca cccacctctt cctggtcaac 
241 ctcagtctgg gcgatctgct ggtatccctg ttcggagtca ccttcacctt cgcgtcgtgc 
301 ctgcggaacg gctgggtgtg ggacgccgtg ggctgcgcgt gggacgggtt tagcggcagc 
5 361 ctctttgggt ttgtttccat taccaccctc actgtgctgg cttatgaacg ttatatccgt 
421 gtggtacatg ccagagtgat caatttttcc tgggcctgga gggccattac ctatatctgg 
481 ctctactctt tggcatgggc aggagcgcct ctcctgggct ggaacaggta catcctcgat 
541 gtacatggac tgggctgtac tgtggactgg aaatccaagg atgccaacga ctcctccttt 
601 gtgctcttcc tgtttcttgg ctgtctggtg gtgcccatgg gcatcatagc ccattgctac 
10 661 ggtcatattc tgtattctgt tcgaatgctt cgctgtgttg aagatcttca gacaattcaa 
721 gtgatcaaga tgttgcgata cgaaaagaaa gttgccaaga tgtgcttttt gatggccttt 
781 gtctttctca cctgctggat gccttatgtt gtaacccgct tcttggtggt caatggctat 
841 ggacacctgg tcaccccaac tgtgtctatc gtttcttatc tctttgctaa atcgagcact 
901 gtgtacaacc cagttatcta catcttcatg atcagaaagt ttcggaggtc ccttctgcaa 
IS 961 ctcctctgtt tccgcctgct gagatgccag aggcctgcta aaaacctccc agcagctgag 
1021 agtgaaatgc agatcaggcc cattgtgatg tcacagaaag atggggacag gccaaagaao 
1081 aaagtgacat ttaactcttc ctccatcatc ttcatcatca ccagtgatga gtccctgtca 
1141 gtcgaggaca gcgacagaag cagcgcatct aaggtcgatg taatccaagt gcgtccttta 
1201 tga 

20 SEQIDNO:211 

Amino acid sequence of rat OPN3 encoded by the DNA sequence shown in SEQ ID NO: 210. 

NYSGNRSGGQGYWEDGAOAEGAAPAOTOSPAPLFSPTAYERLALLLGCLALLOVQGNLLV 
LLLYSKPPRLRTPTHLFLVNLSIiGDLLVSLFGVTPTPASCLRNGWVWDAVGCAWDGPSGS 
LFGPVSITTLTVIJVYERYIRVVHARVINFSWAWRAITYIWLYSIAWAGAPLLGWNRYI^ 
25 VHGLGCTVDWKSKDANDSSFVLFLFLGCLVVPMGIIAHCYGHILYSVRMLRCVEDLQTLO 
VIKMLRYEKKVAKMCFLMAFVFIiTCWMPYVV^ 

VYNPVIYIFMIRKFRRSLLQLLCFRLLRCQRPAKNLPAAESEMQIRPIVMSQKDGDRPKK 
KVTFNSSSIIFIITSDESLSVEDSDRSSASKVDVIQVRPL 

SEQ ID NO: 212 

30 gi|l 8860860|ref]NM_006504.2| Homo sapiens protein tyrosine phosphatase, receptor type, E 
(PTPRE), transcript variant 1, mRNA 

I agccggagct ggagccgagg cggcggcggg acgcggccgg ccggacaaat ttcctgctag 
61 gctgcggacg agcgggcggc aggagccggc gcgagcggct tcaggaaccc acggcctctg 
121 cgcgtccccg cgacccttct tcgcgcccgg cgaagacagc cgggcgcccc ggagggcggc 
35 181 gggcaggcgc ccgggagatg cggagcctcc gctgcagcgc gatctgcgcg accagaccgg 
241 cccccccgag actatagcct tcactttccc tcggtccacc atggagccct tgtgtccact 
301 cctgctggtg ggttttagct tgccgctcgc cagggctctc aggggcaacg agaccactgc 
361 cgacagcaac gagacaacca cgacctcagg ccctccggac ccgggcgcct cccagccgct 
421 gctggcctgg ctgctactgc cgctgctgct cctcctcctc gtgctccttc tcgccgccta 
40 4 81 cttcttcagg ttcaggaagc agaggaaagc tgtggtcagc accagcgaca agaagatgcc 
S41 caacggaatc ttggaggagc aagagcagca aagggtgatg ctgctcagca ggtcaccctc 
601 agggcccaag aagtattttc ccatccccgt ggagcacctg gaggaggaga tccgtatcag 
661 atccgccgac gactgcaagc agtttcggga ggagttcaac tcattgccat ctggacacat 
721 acaaggaact tttgaactgg caaataaaga agaaaacaga gaaaaaaaca gatatcccaa 
45 781 catccttccc aatgaccatt ctagggtgat tctgagccaa ctggatggaa ttccctgttc 
841 agactacatc aatgcttcct acatagatgg ttacaaagag aagaataaat tcatagcagc 
901 tcaaggtccc aaacaggaaa cggttaacga cttctggaga atggtctggg agcaaaagtc 
961 tgcgaccatc gtcatgttaa caaacttgaa agaaaggaaa gaggaaaagt gccatcagta 
1021 ctggcccgac caaggctgct ggacctatgg aaacatccgg gtgtgcgtgg aggactgcgt 
50 1081 ggttttggtc gactacacca tccggaagtt ctgcatacag ccacagctcc ccgacggctg 
1141 caaagccccc aggctggtct cacagctgca cttcaccagc tggcccgact tcggagtgcc 
1201 ttttaccccc attgggatgc tgaagttcct caagaaagta aagacgctca accccgtgca 
1261 cgctgggccc atcgtggtcc actgtagcgc gggcgtgggc cggacgggca ccttcattgt 
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1321 gatcgatgcc atgatggcca tgatgcacgc ggagcagaag gtggatgtgt ttgaatttgt 
1381 gtctcgaatc cgtaatcagc gccctcagat ggttcaaacg gatatgcagt acacgttcat 
1441 ctaccaagcc ttactcgagt actacctcta cggggacaca gagctggacg tgtcctccct 
1501 ggagaagcac ctgcagacca tgcacggcac caccacccac ttcgacaaga tcgggctgga 
5 1561 ggaggagttc aggaaattga caaatgtccg gatcatgaag gagaacatga ggacgggcaa 
1621 cttgccggca aacatgaaga aggccagggt catccagatc atcccgtatg acttcaaccg 
1681 agtgatcctt tccatgaaaa ggggtcaaga atacacagac tacatcaacg catccttcat 
1741 agacggctac cgacagaagg actatttcat cgccacccag gggccactgg cacacacggt 
1801 tgaggacttc tggaggatga tctgggaatg gaaatcccac actatcgtga tgctgacgga 

10 1861 ggtgcaggag agagagcagg ataaatgcta ccagtattgg ccaaccgagg gctcagttac 
1921 tcatggagaa ataacgattg agataaagaa tgataccctt tcagaagcca tcagtatacg 
1981 agactttctg gtcactctca atcagcccca ggcccgccag gaggagcagg tccgagtagt 
2041 gcgccagttt cacttccacg gctggcctga gatcgggatt cccgccgagg gcaaaggcat 
2101 gattgacctc atcgcagccg tgcagaagca gcagcagcag acaggcaacc accccatcac 

15 2161 cgtgcactgc agtgccggag ctgggcgaac aggtacattc atagccctca gcaacatttt 
2221 ggagcgagta aaagccgagg gacttttaga tgtatttcaa gctgtgaaga gtttacgact 
2281 tcagagacca catatggtgc aaaccctgga acagtatgaa ttctgctaca aagtggtaca 
2341 agattttatt gatatatttt ctgattatgc taatttcaaa tgaagattcc tgccttaaaa 
2401 tattttttaa tttaatggtc agtatatttt gtaaaaatca tgttaattta tttcatagtt 

20 2461 gacattaata tcttccctaa tttctttgta tatattttgt tatgccttaa aggccacctg 
2521 ctatacagtt gttaaatctt aaatatgctt tttaaaaatt ggaataatgt attaaggtca 
2581 aataatatcc cataaaatat atatttctgc taatattagt aaatatctta atttttaaaa 
2641 aaaaaaaaaa aaaa 

SEQIDNO:213 

25 Amino acid sequence of human PTPRE encoded by the DNA sequence shown in SEQ ID 
NO: 212. 

MEPLCPLLLVGFSLPIjARALRGNETTADSNETTTTSGPPDPGASQPLLAWLLLPLLLLLL 
VLLLAA YFFRFRKQRKAVVSTSDKKMPNG I LE EQEQQR VM LLSRS PSGPKKY FP I P VEHL 
EEEIRIRSADDCKQFREEFNSLPSGHIQGTFELANKEENREKNRYPNILPNDHSRVILSQ 

30 LDGIPCSDYIHASYIDGYKEKNKFIAAQGPKQETVNDFWRMVWEQKSATIVMLTNLKERK 
EEKCHQYWPDQGCWTYGNIRVCVEDCWLVDYTIRKFCIQPQLPDGCKAPRLVSQLHFTS 
WPDFGVPFTPIGMLKFLKKVKTLNPVHAGPIVVHCSAGVGRTGTFIVIDAMMAMMHAEQK 
VDVFEFVSRI RNQRPQMVQTDMQYTF 1 YQALLE Y YLYGDTELDVS SLEKHLQTMHGTTTH 
FDKIGLEEEFRKLTNVRIMKENMRTGNLPANMKKARVIQI I PYDFNRVILSMKRGQEYTD 

35 YINASFIDGYRQKDYFIATQGPLAHTVBDFWRMIWEWKSHTIVMLTEVQEREQDKCYQYW 
PTEGSVraGEITIEIKNDTLSEAISIRDFLVTLNQPQARQEEQVRVVRQFHFHGWPEIGI 
PAEGKGMIDLIAAVQKQQQQTGNHPITVHCSAGAGRTGTFIALSNILERVKAEGLLDVFQ 
AVKSLRLQRPHMVQTLEQYEFCYKWQDFIDIFSDYANFK 

SEQ ID NO: 214 

40 gi|40805848|ref]NM_l 30435.2| Homo sapiens protein tyrosine phosphatase, receptor type, E 
(PTPRE), transcript variant 2, mRNA 

1 gtgcagcaga gggcagctga gaggctgggt ggctgggcct gggagacaca cagaggccag 
61 gccttagcgc ggctcagcca tgagcaacag gagtagcttt tcccggctca cctggttcag 
121 gaagcagagg aaagctgtgg tcagcaccag cgacaagaag atgcccaacg gaatcttgga 

45 181 ggagcaagag cagcaaaggg tgatgctgct cagcaggtca ccctcagggc ccaagaagta 
241 ttttcccatc cccgtggagc acctggagga ggagatccgt atcagatccg ccgacgactg 
301 caagcagttt cgggaggagt tcaactcatt gccatctgga cacatacaag gaactttcga 
361 actggcaaat aaagaagaaa acagagaaaa aaacagatat cccaacatcc ttcccaatga 
421 ccattctagg gtgattctga gccaactgga tggaattccc tgttcagact acatcaatgc 

50 481 ttcctacata gatggttaca aagagaagaa taaattcata gcagctcaag gtcccaaaca 
541 ggaaacggtt aacgacttct ggagaatggt ctgggagcaa aagtctgcga ccatcgtcat 
601 gttaacaaac ttgaaagaaa ggaaagagga aaagtgccat cagtactggc ccgaccaagg 
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661 ctgctggacc tatggaaaca tccgggtgtg cgtggaggac tgcgtggttt tggtcgacta 
721 caccatccgg aagttctgca tacagccaca gctccccgac ggctgcaaag cccccaggct 
781 ggtctcacag ctgcacttca ccagctggcc cgacttcgga gtgcctttta cccccattgg 
841 gatgctgaag ttcctcaaga aagtaaagac gctcaacccc gtgcacgctg ggcccatcgt 
901 ggtccactgt agcgcgggcg tgggccggac gggcaccttc attgtgatcg atgccatgat 
961 ggccatgatg cacgcggagc agaaggtgga tgtgtttgaa tttgtgtctc gaatccgtaa 
1021 tcagcgccct cagatggttc aaacggatat gcagtacacg ttcatctacc aagccttact 
1081 cgagtactac ctctacgggg acacagagct ggacgtgtcc tccctggaga agcacctgca 
1141 gaccatgcac ggcaccacca cccacttcga caagatcggg ctggaggagg agttcaggaa 
1201 attgacaaat gtccggatca tgaaggagaa catgaggacg ggcaacttgc cggcaaacat 
1261 gaagaaggcc agggtcatcc agatcatccc gtatgacttc aaccgagtga tcctttccat 
1321 gaaaaggggt caagaataca cagactacat caacgcatcc ttcatagacg gctaccgaca 
1381 gaaggactat ttcatcgcca cccaggggcc actggcacac acggttgagg acttctggag 
1441 gatgatctgg gaatggaaat cccacactat cgtgatgctg acggaggtgc aggagagaga 
1501 gcaggataaa tgctaccagt attggccaac cgagggctca gttactcatg gagaaataac 
1561 gattgagata aagaatgata ccctttcaga agccatcagt atacgagact ttctggtcac 
1621 tctcaatcag ccccaggccc gccaggagga gcaggtccga gtagtgcgcc agtttcactt 
1681 ccacggctgg cctgagatcg ggattcccgc cgagggcaaa ggcatgattg acctcatcgc 
1741 agccgtgcag aagcagcagc agcagacagg caaccacccc atcaccgtgc actgcagtgc 
1801 cggagctggg cgaacaggta cattcatagc cctcagcaac attttggagc gagtaaaagc 
1861 cgagggactt ttagatgtat ttcaagctgt gaagagttta cgacttcaga gaccacatat 
1921 ggtgcaaacc ctggaacagt atgaattctg ctacaaagtg gtacaagatt ttattgatat 
1981 attttctgat tatgctaatt tcaaatgaag attcctgcct taaaatattt tttaatttaa 
2041 tggtcagtat attttgtaaa aatcatgtta atttatttca tagttgacat taatatcttc 
25 2101 cctaatttct ttgtatatat tttgttatgc cttaaaggcc acctgctata cagttgttaa 
2161 atcttaaata tgctttttaa aaattggaat aatgtattaa ggtcaaataa tatcccataa 
2221 aatatatatt tctgctaata ttagtaaata tcttaatttt tcattagatt catatcattt 
2281 aatttcacat attcaacacc tttaaatgtt gtaatcttaa tatgcgaagt gtgcctctgc 
2341 aagatactaa cacaaagctc atgttaagaa aacagttgag gactcagaag tcagttgaaa 
30 2401 atgcactttc ctaacagtga attcacaacc ctgaacagca gcatttttgg aaggcaaact 
2461 gttcgtgatg gtacaatgta aatggggact tctgtaaagt tctcagtttc ggtccatgtg 
2521 gtttatcttt acattttaaa gatcaaagaa gtctttacaa cctgaatcca ggtctaaaac 
2581 acactagagt agctggtgac tataaataat attttaaaat gctgtgtcta caccatcaag 
2641 actgtgtcta cactatcttg gctgaacgag aagagatgta aatgctgggt ggtcccgttg 
35 2701 acccacggcg ttgggtacaa caaaaccagc catcggagtt acaccccaaa gcaccatttg 
2761 ctgtccagct gcctgtcgtt tggcccagac caccctcaga aaaaaaccag ctgcctctcc 
2821 cattctcccc tcccgttctg ccacagcggc ctgggctggt ccagtgctat gcctggaggc 
2881 tcaacacaaa acttcccatc caaacattca gatgaactga gcgtcttaca cacgcagtac 
2941 agaggagcac acattaggat agaaacagta gaataaccac gggcaattaa actttaaatt 
3001 ttctgagcag cattttggta tttaaacatt tcttgtaaaa agctgagaca gtttgtaaga 
3061 aaagaatcct taaaatctag atttatacca ttttttaaag tcccaccttt caatgtttaa 
3121 taaaacaaaa agagaaatcc ttaatctaaa agctaaatta tttttgaatg gaaatactac 
3181 tgagaccatt gacactggat aacagtaatg atcccattac cagataagat tgactgacgg 
3241 ggaaaaaaaa aaaaaaagaa tggggtgtga atgtaccaac actgaatctt acagcagtta 
45 3301 tctttctatg gccattaggt acctagcaga tgtgcacaat ataaacaaaa agatatctgg 
3361 cctaccttac tactaaaagc atttaacacg tgcatttttg gtactttttt ttttgttttc 
3421 taaaagctac ataaaggcct tatttgacat tttcactgat aactgatcgc accctcatgt 
3481 tgcagtgttc gtcccctatt cattaacagt gtgtgcaaat gcaaccccag gacacattat 
3541 tgttctgata gatgctcaca gggaatcaag cttctcgccc actccacggc tctgagcccc 
3601 atccaagggc aagacttggt gcccagctgg aaggacgaaa gcacactttg tgaccgccat 
3661 cctcaccagc tgcatgcctg ggctgcacac tgctgaacgt gctcctctct ccttctctgt 
3721 acaatgattc agcatctcgg cggaagagga aaatggagct ttttgaggct cgccaggtcc 
3781 cttttgtttt caccattaaa attccaaacc caaagccttt gtttgactga aggagaagag 
3841 agggaagtaa gcttctgttc agcacctgaa ccctagaaaa agagccagtt tgctacgatg 
3901 aaggtgacat ttctctggtc atttatttga gagttcgaag tcaaagtcga ggggcaccgg 
3961 ctttggttca tgtctaggag ccctctgtca gaatccttga agccctttaa tggtctaact 
4021 ggcatctctt gtatcaagaa gtacctttaa ggtagacctt ttcagggtgc cctcaggaaa 
4081 gggccctgct catgtttttt tcctgtcccc ttagaccaac cccaggtgtc cactgcaggg 
4141 gttctgcctg ttcccaaact ttttccattc caggaacaaa ggagaagcca ctttccccag 
4201 gacgcaagac tctcccctcc actgtccggg acagcgttcg ccctttagcg gggaggtcat 
4261 tacagcctca tggcctctac caaggcccca gatcacagga tctcctgggc cttggagcac 
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4 321 ctcacgctgg gggaatcaat ccctgaggga ctcagaatct tctccgtgca acctggaaag 
4381 ttcatctctt gtttccttca gtcaaagaaa gtccattgta cataacaaaa cagcccccaa 
4441 acagcccagt gccgacacca ttgttccttt cacactttcc tttgttgcat gcagttgggt 
4501 tcaaatgcca aatagtgatt agaagacgac cattctgatc tgtgtgtgat ctggtcacta 
5 4561 tgtgactgcc tttacggttt ctctccatgt gctatatgaa tgaagaatgc ataccagtgt 
4621 tttaaaaggt atttttatgt gtttttaaac acttttttaa atgagcctga cacctgtgtt 
4681 tcagcatttg gagacatccc catgttattc ttttaagtgt ataattactg atactttttt 
4741 gtttgtttgt ttaactaagt tgtgtttaac ttatgtgcag tctttataat gtatgtatgt 
4801 tattacagtt tcaactatca tattttcttt gattacattt ataatttgat cttgctctga 
10 4861 ttataatgcc agtgaatgtt gctgaactct ttgtatatgc aaattgcaag atttaaacca 
4921 ttctgatgca aggataaacc tttactttga ctaccagcct gtgtttttgt ctttaaatct 
4981 cttaatttca ttcctctgca aa 

SEQ ID NO: 215 

Amino acid sequence of human PTPRE variant ORF number 1 encoded by the DNA 
1 5 sequence shown in SEQ ID NO: 2 1 4. 

MSNRSSFSRLTWFRKQRKAWSTSDKKMPNGILBEQEQQRVMLLSRSPSGPKKYFPIPVE 
HLEEEIRIRSADDCKQFREEFNSLPSGHIQGTFELANKEENRBKNRYPNILPNDHSRVIL 
SQLDGIPCSDYINASYIDGYKEKNKFIAAQGPKQETVNDFWRMWEQKSATIVMLTNL^ 
RKEEKCHQYWPDQGCWTYGNIRVCVBDCVVLVDYTIRKFCIQPQLPDGCKAPRLVSQLHF 

20 TSWPDFGVPFTPIGMLKFLKKWCTLNPVH^^ 

QKVDVFEFVSR IRNQR PQMVQTDMQYTF I YQALLEYYLYGDTELDVS SLEKHLQTMHGTT 
THFDKIGLEEEFRKLTNVRIMKENMRTGNLPANMKKARVIQI IPYDFNRVILSMKRGQEY 
TDYINASFIDGYRQKDYFIATQGPLAHTVEDFWRMIWEWKSHTIVMLTEVQEREQDKCYQ 
YWPTEGSVTHGEITIEIKNDTLSBAISIRDFLVTIiNQPQAROEEQVRVVRQFHFHGWPEI 

25 GI PAEGKGNIDL I AAVQKQQQQTGNHP ITVHCS AGAGRTGTF I ALSN I LER VKAEGLLD V 
FQAVKSLRLQRPHMVQTLEQYEFCYKWQDFIDIFSDYANFK 

SEQ ID NO: 216 

gi|19171238|emb|AJ430580.1|HSA430580 Homo sapiens mRNA for tyrosine phosphatase 
epsilon PD1 (PTPRE gene) 

30 1 ccttagcgcg gctcagccat gagcaacagg agtagctttt cccggctcac ctggttcagg 

61 aagcagagga aagctgtggt cagcaccagc gacaagaaga tgcccaacgg aatcttggag 
121 gagcaagagc agcaaagggt gatgctgctc agcaggtcac cctcagggcc caagaagtat 
181 tttcccatcc ccgtggagca cctggaggag gagatccgta tcagatccgc cgacgactgc 
241 aagcagtttc gggaggagtt caactcattg ccatctggac acatacaagg aacttttgaa 
35 301 ctggcaaata aagaagaaaa cagagaaaaa aacagatatc ccaacatcct tcccaatgac 
361 cattctaggg tgattctgag ccaactggat ggaattccct gttcagacta catcaatgct 
421 tcctacatag atggttacaa agagaagaat aaattcatag cagctcaagg tcccaaacag 
481 gaaacggtta acgacttctg gagaatggtc tgggagcaaa agtctgcgac catcgtcatg 
541 ttaacaaact tgaaagaaag gaaagaggaa aagtgccatc agtactggcc cgaccaaggc 
40 601 tgctggacct atggaaacat ccgggtgtgc gtggaggact gcgtggtttt ggtcgactac 
661 accatccgga agttctgcat acagccacag ctccccgacg gctgcaaagc ccccaggctg 
721 gtctcacagc tgcacttcac cagctggccc gacttcggag tgccttttac ccccattggg 
781 atgctgaagt tcctcaagaa agtaaagacg ctcaaccccg tgcacgctgg gcccatcgtg 
841 gtccactgta gcgcgggcgt gggccggacg ggcaccttca ttgtgatcga tgccatgatg 
45 901 gccatgatgc acgcggagca gaaggtggat gtgtttgaat ttgtgtctcg aatccgtaat 
961 cagcgccctc agatggttca aacggatatg cagtacacgt tcatctacca agccttactc 
1021 gagtactacc tctacgggga cacagagctg gacgtgtcct ccctggagaa gcacctgcag 
1081 accatgcacg gcaccaccac ccacttcgac aagatcgggc tggaggagga gttcaggaaa 
1141 ttgacaaatg tccggatcat gaaggagaac atgaggacgg gcaacttgcc ggcaaacatg 
50 1201 aagaaggcca gggtcatcca gatcatcccg tgtaaggcac ccgtggcgtg gcttgggcag 
1261 ggctggggct gggctggtgc cggaggcttt catcctggag aagccattga ccgcttaccc 
1321 ctgtgcacca gggtcaaaag cagggtgtcc tctggcctca ggctaagggc tcctgtgtgc 
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1381 tgtaccaaat gtcaggtaga aaagacacct gtgaagcacc agctgaccca gacagtcctg 
1441 catgggtctc aggcaagctg cttgggatat gagcttgcag atgcgggagg gacgccgagg 
1501 ctgatttcag gggagggctt cacagaagct ggagatttgc ggcaggcacg acttgggatt 
1561 ccttgtctca ttccttgtct ccccaagcct gggcagaggc ccgcctcccc cttacacaga 
5 1621 ggggcctgag ccacccagcc gaggccatgc aggtgaactc agctgagacc acctaggaaa 
1681 aggctgtggc cttgaggagg acaagaaatg ggcctagggt agagtcagga cactggtctg 
1741 ccacccaggt catgggagag ggaaggagga cctggcaggt gtgggaggag tggaggcaaa 
1801 gctggcaaga agacagggcc acatgcagtg ggcaggggag tgggaatcgg gaggctcacc 
1861 aagtgggaaa aagagaaact tgagaaaccc ggccccaaag cagactccac agggatccga 

10 1921 ggaactaggg agggaggtgg gagatgcccc gcttgggagg agggtgtgcc tcgttataga 
1981 cttgtcctcg acagagtcac agctcaacaa acaccatctg gaaacagctg ggagcaggcc 
2041 tgctgcagca gggtctcccc gagcactgca gatgacttca accgagtgat cctttccatg 
2101 aaaaggggtc aagaatacac agactacatc aacgcatcct tcatagacgg ctaccgacag 
2161 aaggactatt tcatcgccac ccaggggcca ctggcacaca cggttgagga cttctggagg 

15 2221 atgatctggg aatggaagtc ccacactatc gtgatgctga cggaggtgca ggagagagag 
2281 caggataaat gctaccagta ttggccaacc gagggctcag ttactcatgg agaaataacg 
2341 attgagataa agaatgatac cctttcagaa gccatcagta tacgagactt tctggtcact 
2401 ctcaatcagc cccaggcccg ccaggaggag caggtccgag tagtgcgcca gtttcacttc 
2461 cacggctggc ctgagatcgg gattcccgcc gagggcaaag gcatgattga cctcatcgca 

20 2521 gccgtgcaga agcagcagca gcagacaggc aaccacccca tcaccgtgca ctgcagtgcc 
2581 ggagctgggc gaacaggtac attcatagcc ctcagcaaca ttttggagcg agtaaaagcc 
2641 gagggacttt tagatgtatt tcaagctgtg aagagtttac gacttcagag accacatatg 
2701 gtgcaaaccc tggaacagta tgaattctgc tacaaagtgg tacaagattt tattgatata 
2761 ttttctgatt atgctaattt caaatgaaga ttcctgcctt aaaatatttt ttaatttaat 

25 2821 ggtcagtata ttttgtaaaa atcatgttaa tttatttcat agttgacatt aatatcttcc 
2881 ctaatttctt tgtatatatt ttgttatgcc ttaaaggcca cctgctatac agttgttaaa 
2941 tcttaaatat gctttttaaa aattggaata atgtattaag gtcaaataat atcccataaa 
3001 atatatattt ctgctaataa aaaaaaactt t 

SEQIDNO:217 

30 Amino acid sequence of human PTPRE variant ORF number 2 encoded by the DNA 
sequence shown in SEQ ID NO: 216. 

MSNRSSFSIUiTWFRKQRKAWSTSDKKMPNQILEEQEQQRVMLLSRSPSGPKKYFPIPVE 
HLEEEIR IRS ADDCKQ FREE FNSLPSGHIQGTFELANKEENREKNRYPNILPNDHSRVIL 
SQLDGIPCSDYINASYIDGYKEKNKFIAAQGPKQETV^FWRMWEQKSATIVMIiTNLKE 
35 RKEEKCHQYWPDQGCTTYGNIRVC^DCWLVDYTIRKFCIQPQLPDGCKAPRliVSQLHF 
TSWPDFGVPFTPIGMLKFLKKVTCTLNPVHAGPIVVHCSAGVGRTO 

QKVD VFEF VSR I RNQRPQMVQTDMQYT F I YQALLE YYL YGDTELDVS S LEKHLQTMHGTT 
THFDKIGLEEEFRKLTNVRIMKENMRTGNLPANMKKARV1QI I PCKAPVAWLGQGWGWAG 
AGGFHPGEAIDRLPLCTRVKSRVSSGLRLRAPVCCTKCQVEKTPVKHQLTQTVLHGSQAS 
40 CLG YELADAGGTPRL I S GEGFTEAGDLRQARLGI PCLI PCLPKPGQRPAS PLHRGA 

SEQ ED NO: 218 

gi|l 199932|dbj|D83484.1|MUSPTPE Mouse mRNA for protein tyrosine phosphatase epsilon, 
complete cds 

1 cacgggtccc agccctgctt taccagttgc gaaaccttaa gtgccacact actttgtggt 

45 61 gtncagagct aaggctaaat acgaagggac gttgagtttt ctctatggct cgtctgagtg 

121 ccatcacctt ttaggatcct ccctgataga gtgtcagtca gagtgcagga gagaggcatg 

181 ttcctgcatc ctcccagtgc aaagccaaag ctccagaggg tgagcgccgc gtgcgctccc 

241 agagccaggc accaggcgtc gcqgtcgccc ctcgcagctc cacggccccg ccccnnccgc 

301 ggagctggag ctggagccgg agccggagcc ctagcccaga gctggaggcg gccggaccgg 

50 361 gccagacaga tttcctgctc tcctgtaggg ctgagggctg ccggctgcgg gctacgggct 

421 gcgggctaca ggctacgggc tacggactgc gggctgcgag agaagtcacc ggcggcgcag 

481 gtacctcact gcctacgcgc gtccccacga ccctccctcg cgccggcggg gacagcggac 
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541 gccccggagc ggagcgcgtc gggcgggcgc ccgggagatg cggagtcgcc gcggcggagc 
601 gatcggggct acagcaccgg tccctgggag actatagccc tcatttcccc ttggtgcacc 
661 atggagccct tctgtccact cctgctggca agttttagct tgtcgctcgc cagagctggc 
721 cagggcaacg acaccacccc aacagagagc aactggacca gcacaactgc aggccctccg 
5 781 gaccctggtg catcccagcc gctgctcacc tggctgctgc tgcccctgct cctcctcctg 
841 ttcctgcttg cagcctactt cttcaggttc cggaagcaga ggaaggccgt ggtcagcagc 
901 aacgacaaga aaatgcctaa cgggatctta gaagagcaag agcagcagag agtgatgctg 
961 ctgagcagat ctccatcagg ccccaagaag ttcttcccca tccccgtgga gcacctggag 
1021 gaggagatcc gggtgagatc tgcggatgac tgcaagcgat tccgagagga gttcaattca 
10 1081 ttgccatctg gacacataca aggaaccttt gaactagcaa ataaagaaga aaacagagaa 
1141 aaaaacagat accccaacat tctgcccaat gatcattgca gagtgatttt gagccaagtg 
1201 gatggaatcc cctgctctga ctacattaat gcttcctaca tcgatggcta caaagaaaag 
1261 aacaaattca tagcagctca aggccctaag caggagacag tgaatgactt ctggagaatg 
1321 gtctgggagc aaaggtcagc caeca teg tc atgttgacga acctgaagga gaggaaggag 
15 1381 gagaagtget accagtactg gccagaccag ggctgttgga cctacggcaa catcegggtg 
1441 tgtgtagagg actgcgtggt cctggtggat tacacgatcc gaaagttctg catccatccg 
1501 caactcccag acagctgeaa agccccgcgg ctggtctcac agetgeaett caccagctgg 
1561 cctgacttcg gggtgccgtt tacccccatc gggatgetea agttcctgaa gaaagtgaag 
1621 acactcaacc cctcacatgc tgggeccatt gtggttcact gtagcgcggg cgtgggtcgg 
20 1681 actggcacct tcattgtgat egatgecatg atggacatga tacactegga gcagaaggtt 
1741 gaegtctttg agtttgtgtc tagaatcege aatcagcgcc ctcagatggt ecagaeggat 
1801 gttcagtata cattcatcta ccaagcctta ctggaatact acctctatgg ggacacagag 
1861 ctggatgtgt cctccctgga gaggcacctg cagacgctcc atagcacagc cacccatttt 
1921 gacaagatcg ggctggagga agagttcagg aagctgacca acgtgcgaat catgaaggag 
25 1981 aacatgagga cgggcaacct gcctgccaac atgaagaagg cccgcgtcat ccagatcatt 
2041 ccatatgact teaategggt catcctgtcc atgaaaagag ggcaagagtt cacagactat 
2101 ateaaegcat ccttcataga tggctacagg cagaaggact acttcatggc cacacaggcg 
2161 cctctggctc acacagttga ggacttctgg aggatggtat gggagtggaa gtctcacaca 
2221 atgctcatgc tgacggaggt geaggagegg gaacaggata aatgetacca gtattggcca 
30 2281 aeggaggget eggtgactea tggagatata actatagaga taaagagega caccctgtct 
2341 gaagcaatca gegtacgaga ctttctggtt accttcaaac agcccctggc ccgccaggaa 
2401 gagcaggtcc gcatggtgag acaattccat ttccatggct ggcctgaggt tggcatcccc 
2461 gctgaaggca aaggcatgat tgacctgatt gcagcagtgc agaagcagca gcagcagacg 
2521 ggcaaccacc ccatcaccgt gcactgcagc gcgggagcag ggeggacagg tacattcata 
35 2581 gcactcagta acattttgga acgagtgaaa gecgagggae tectagaegt gtttcaagct 
2641 gtgaagagct taagacttca gagaccacac atggtgcaga ccctggagca atatgaattc 
2701 tgctacaaag tggtacaaga ttttatcgat atattttctg attatgetaa tttcaaatga 
2761 agattcctgc cttaaaatat tttttaattt aatggtcagt atattttgta aaaaaaaaaa 
2821 aaaaaaa 

40 SEQIDNO:219 

Amino acid sequence of mouse PTPRE encoded by the DNA sequence shown in SEQ ID 
NO: 218. 

MEPFCPLLLASFSLSLARAGQGNDTTPTESNWTSTTAGPPDP 

FLLAAYFFRFRKQRKAWSSNDKKMPNGILEEQEQQRVMLLSRSPSGPKKFFPIPVEHLE 
45 EEIRVRSADDCKRFREEFNSLPSGHIQGTFELANKEENREKNRYPNILPNDHCRVILSQV 
DOIPCSDYINASYIDGYKEKNKFIAAQGPKQETVNDFWRMVWEQRSATIVMLTNLKERKE 
EKCYQYWPDQGCWTYGNIRVCVEDCNA^VDYTIRKFCIHPQLPDSCKAPRLVSQIjHFTSW 
PDFGVPFTPIGMLKFLKKVKTLNPSHAGPIVVHCSAGVGRTGTFIVIDAMMDMIHSEQKV 
DVFEFVS R I RNQRPQMVQTDVQYTF I YQALLE YYLYGDTELDVS SLERHLQTLHSTATHF 
50 DKIGLEEEFRKLTNVRIMKENMRTGNLPANMKKARVIQIIPYDFNRVILSMKRGQEFTDY 
INASFIDGYRQKDYFMATQAPIAHTVEDFWRMWEW^HTPILMLTEVQEREQDKCYQYWP 
TEGSVTHGDITIEIKSDTLSEAISVRDFXVTFKQPLARQEEQVRMVRQFHFTJGWPEVGIP 
AEGKGMIDLIAAVQKQQQQTGNHPITVHCSAGAGRTGTFIALSNILERVKAEGLLDVFQA 
VKSLRLQRPHMVQTLEQYEFCYKWQDFIDIFSDYANFK 

55 SEQ ID NO: 220 
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gi|34861 1 50|ref)XM_34 1950.1 1 Rattus norvegicus Protein tyrosine phosphatase, receptor type, 
epsilon polypeptide (Ptpre), mRNA 

1 atggaaccct tctgtccact cctgttggca agttttagct tgtcgcttgc cacagctggc 
61 cagggcaacg acaccacccc aacagagagc aactggacca gtacaactgc aggtcctccg 
121 gaccctggca cgtcccagcc gctgcttacc tggctgctgc tgcctctgct cctcctcctg 
181 ttcctccttg cagcctactt cttcaggttc aggaagcaga ggaaagccgt ggtcaacagc 
241 aacgacaaga agatgcctaa cgggatccta gaagaacaag agcagcagag agtgatgcta 
301 ctgagcagat ctccatcggg tcccaagaag tacttcccca tccctgtgga gcacctggag 
361 gaggagatcc gggtgagatc tgcggatgac tgcaagcggt tccgagagga gttcaattca 
421 ttgccatccg gacacataca aggaaccttt gaactagcaa ataaagaaga aaacagagaa 
481 aaaaacagat accccaacat tctgcccaat gatcattgca gagtgatttt gagccaattg 
541 gatggaatcc cctgctctga ctacattaat gcttcctaca tagatggcta caaagaaaag 
601 aacaaattca tagcagctca aggccccaag caggagacag tgaatgactt ctggagaatg 
661 gtctgggagc aaaggtcagc caccattgtc atgttgacga acctgaagga aaggaaggag 
721 gagaagtgct accagtactg gccagaccag ggctgttgga cttatgggaa catccgggtg 
781 tgtgtagagg actgcgtggt cctggtggat tacaccatcc gcaagttctg catccatccg 
841 caactccccg acagctgcaa agctccacgg ctggtctcac agctgcactt caccagctgg 
901 cctgactttg gggtaccatt tacccccatc ggaatgctga agttcctgaa gaaagtgaag 
961 acactcaacc cctcacatgc tgggcccatt gtggtccact gtagcgcggg cgtgggtcgg 
1021 actggcacct tcattgtgat cgatgccatg atggacatga tccactcgga acagaaggtt 
1081 gacgtcttcg agtttgtgtc tagaatccgc aatcagcgcc ctcagatggt ccagacagat 
1141 gttcagtata cattcatcta ccaagcctta ctggaatact acctctatgg ggacacggag 
1201 ctggatgtgt cctccttgga gaggcacctg cagacgctac atggcacagc cacccatttt 
1261 gacaagatcg ggctggagga agagttcagg aaactgacca acgtgcgaat catgaaggag 
1321 aacatgagga cgggcaacct gccggccaac atgaagaagg cccgtgtaat ccagatcatc 
1381 ccatatgact tcaatcgggt aatcctgtcc atgaaaagag ggcaagagtt cacagactac 
1441 atcaatgcat ccttcataga tggctacagg cagaaggact acttcatggc tacacagggg 
1501 cctctggcgc acacagttga ggacttctgg aggatggtct gggagtggaa gtcacacaca 
1561 atcgtcatgc tgactgaggt gcaggagcgg gagcaggata aatgctacca gtattggcca 
1621 accgagggct cagtgactca cggagatata actatagaaa taaagagtga caccctgtct 
1681 gaagcaatca gcatacgaga ctttctggtt actttcaaac agcccctggc ccgccaggaa 
1741 gagcaggtcc ggatggtgag acaattccat ttccatggct ggcctgaggt tggaatcccc 
1801 actgagggta aaggcatgat tgacctgatc gcggcagtgc agaagcagca gcagcagaca 
1861 ggcaaccacc ccatcaccgt gcactgcagt gcgggagcag ggcggacagg tacattcata 
1921 gcactcagta acattttgga acgagtgaaa gccgagggac tcctagatgt gtttcaagct 
1981 gtgaagagct taagacttca gagaccacac atggtgcaaa ccctggagca atatgaattc 
2041 tgctacaaag tggtacaaga ttttatcgat atattttctg attatgctaa tttcaaatga 
2101 agatccctgc cttaaaatat tttttaattt aatggtcagt atattttgta aaaaaaaatc 
2161 atgttaattt atttcctagt ggatattaat atccgtcctg attcctttgt atatattttg 
2221 ttatgttcta aaggctacct gctgtaagat tattaaatca taaatgtcct tttt 



SEQIDNO: 221 

Amino acid sequence of rat PTPRE encoded by the DNA sequence shown in SEQ ID NO 
220. 



MBPFCPLI^SFSLSLATAGQGNPTTPTESNWTSTTAGPPDPGTSQPLLTWLIiLPLLLIiL 
FLLAAYPPRFRKQRKAVVNSNDKKMPNGILEEQEQQRVMLLSRSPSGPKKYFPIPVEHLE 
EE1RVRSADDCKRFREEFNSLPSGHIQGTFELANKEENREKNRYPNILPNDHCRVILSQL 
DGIPCSDYINASYIDGYKEKNKFIAAQGPKQETVOT)^^ 

EKCYQYWPDQGCWTYGNIRVCVEDCVVLVDYTIRKFCIHPQLPDSCKAPRLVSQLHFTSW 
PDFGVPFTPIGMLKFLKKVKTliNPSHAGPIVVHCSAGVGRTGTF3^IDAMMDMIHSEQKV 
DVFEFVSRIRNQRPQMVQTDVQYTFIYQALLEYYLYGDTELDVSSLERHLQTLHGTATHF 
DKIGLEEEFRKLTNVRIMKENMRTGNLPANMKKARVIQIIP 

INAS F IDGYRQKD YFMATQGPLAHTVEDF WRMVWEWKSHT I VMLTEVQEREQDKCYQ YWP 
TEGSVTHGDITIEIKSDTLSEAISIRDFLVTFKQPLARQEEQVRMVRQFHFHGWPEVGIP 
TEGKGMIDLIAAVQKQQQQTGNHPITVHCSAGAGRTGTFIALSNILERVKAEGLLDVFQA 
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VKSLRLQRPHMVQTLEQYEFCYKWQDFIDIFSDYANPK 

SEQ ID NO: 222 

gi|3 1 083343 |refjNM_0203 1 1 . 1 1 Homo sapiens G protein-coupled receptor (RDC1), mRNA 

1 tgcaagtctg cagccagcag agctcacagt tgttgcaaag tgctcagcac taagggagcc 
5 61 agcgcacagc acagccagga aggcgagcga gcccagccag cccagccagc ccagccagcc 
121 cggaggtcat ttgattgccc gcctcagaac gatggatctg catctcttcg actactcaga 
181 gccagggaac ttctcggaca tcagctggcc atgcaacagc agcgactgca tcgtggtgga 
241 cacggtgatg tgtcccaaca tgcccaacaa aagcgtcctg ctctacacgc tctccttcat 
301 ttacattttc atcttcgtca tcggcatgat tgccaactcc gtggtggtct gggtgaatat 
10 361 ccaggccaag accacaggct atgacacgca ctgctacatc ttgaacctgg ccattgccga 
421 cctgtgggtt gtcctcacca tcccagtctg ggtggtcagt ctcgtgcagc acaaccagtg 
481 gcccatgggc gagctcacgt gcaaagtcac acacctcatc ttctccatca acctcttcgg 
541 cagcattttc ttcctcacgt gcatgagcgt ggaccgctac ctctccatca cctacttcac 
601 caacaccccc agcagcagga agaagatggt acgccgtgtc gtctgcatcc tggtgtggct 
15 661 gctggccttc tgcgtgtctc tgcctgacac ctactacctg aagaccgtca cgtctgcgtc 
721 caacaatgag acctactgcc ggtccttcta ccccgagcac agcatcaagg agtggctgat 
781 cggcatggag ctggtctccg ttgtcttggg ctttgccgtt cccttctcca ttatcgctgt 
841 cttctacttc ctgctggcca gagccatctc ggcgtccagt gaccaggaga agcacagcag 
901 ccggaagatc atcttctcct acgtggtggt cttccttgtc tgctggctgc cctaccacgt 
20 961 ggcggtgctg ctggacatct tctccatcct gcactacatc cctttcacct gccggctgga 
1021 gcacgccctc ttcacggccc tgcatgtcac acagtgcctg tcgctggtgc actgctgcgt 
1081 caaccctgtc ctctacagct tcatcaatcg caactacagg tacgagctga tgaaggcctt 
1141 catcttcaag tactcggcca aaacagggct caccaagctc atcgatgcct ccagagtctc 
1201 agagacggag tactctgcct tggagcagag caccaaatga tctgccctgg agaggctctg 
25 1261 ggacgggttt acttgttttt gaacagggtg atgggcccta tggttttcta gagcaaagca 
1321 aagtagcttc gggtcttgat gcttgagtag agtgaagagg ggagcacgtg ccccctgcat 
1381 ccattctctc tttctcttga tgacgcagct gtcatttggc tgtgcgtgct gacagttttg 
1441 caacaggcag agctgtgtcg cacagcagtg ctgtgcgtca gagccagctg aggacaggct 
1501 tgcctggact tctgtaagat aggattttct gtgtttcctg aattttttat atggtgattt 
30 1561 gtatttaaat tttaagactt tattttctca ctattggtgt accttataaa tgtatttgaa 
1621 agttaaatat attttaaata ttgtttggga ggcatagtgc tgacatatat tcagagtgtt 
1681 gtagttttaa ggttagcgtg acttcagttt tgactaagga tgacactaat tgttagctgt 
1741 tttgaaatta tatatatata aatatatata aatatataaa tatatgccag tcttggctga 
1801 aatgttttat ttaccatagt tttatatctg tgtggtgttt tgtaccggca cgggatatgg 
35 1861 aacgaaaact gctttgtaat gcagtttgtg acattaatag tattgtaaag ttacatttta 
1921 aaataaacaa aaaactgttc tggactgcaa atctgcacac acaacgaaca gttgcatttc 
1981 agagagttct ctcaatttgt aagttatttt tttttaataa agatttttgt ttccaaaaaa 
2041 aaaaaaaaaa aaaaaaaaaa aaaa 

SEQ ID NO: 223 

40 Amino acid sequence of human RDC1 encoded by the DNA sequence shown in SEQ ID NO: 
222. 

MDLHLFDYSBPGNFSDISWPCNSSDCIWDTVMCPNMPNKSVLLYTLSFIYIFIFVIOMI 
ANSVVVWWIQAKTTGYDTHCYILNLAIA^ 

HL I FS INL FGS I FFLTCMS VDRYLS ITYFTNTPSSRKKMVW^ WCILVWLIAFCVSLPDT 
45 YYLKTVTS ASNNETYCRSF YPEHS I KEWLIGMELVS WLGFAVPFS 1 1 AVFYFLLARAIS 
ASSDQEKHSSRKIIFSYV\n^LVCWLPYHVAVLLDIFSILHYIPFTCRLEHALFTALHVT 
QCLSLVHCCVNPVLYSFINRNYRYELMKAFIFKYSAKTGLTKLIDASRVSriTErXSALEQS 
TK 

SEQ ID NO: 224 
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gi|31560714|ref|NM_007722.2| Mus muscuius chemokine orphan receptor 1 (Cmkorl), 
mRNA 

1 gctcagttgc tacaaactgc tcagcactga aggagcctgc agcgctcacc gtcaggaagg 
61 caaaccacag cccaggaagc cctgaggtca cttggtcgct ctcctcaaga ccatggatgt 
5 121 gcacttgttt gactatgcag agcctggcaa ctactctgac atcaactggc catgtaacag 
181 cagcgactgc attgtggtgg acactgtgca gtgtcccacc atgcctaaca agaacgtgct 
241 tctgtatacc ctctccttca tctacatttt catcttcgtg atcggcatga ttgccaactc 
301 tgtggtggtc tgggtgaata tccaggctaa gaccacaggc tacgacacgc actgctacat 
361 cttgaacctg gccattgcag acctgtgggt cgtcatcacc atccccgtct gggtggtcag 
10 421 tctcgtgcag cataaccagt ggcccatggg ggagctcaca tgcaagatca eacacctcat 
481 tttctccatc aacctctttg ggagcatctt cttcctcgcc tgcatgagcg tggaccgcta 
541 tctctccatc acctacttca ccggcacctc cagctataag aagaagatgg tacgccgtgt 
601 tgtatgcatc ttggtgtggc tgctggcctt ctttgtgtcc ctgcctgata cctactacct 
661 gaagacggtc acatctgctt ccaacaatga gacctactgc aggtccttct accccgagca 
15 721 cagcatcaag gagtggctga tcggcatgga gctggtctct gtcatcttgg gctttgctgt 
781 ccccttcact atcattgcga tcttctactt cctgctcgct agagccatgt cagcatcagg 
841 cgaccaggag aagcacagta gccggaagat catcttctcc tacgtggtgg tcttcctggt 
901 atgttggctg ccgtaccatt ttgtggttct gctggacatc ttctccatct tacactacat 
961 cccgtttacc tgtcagctgg agaatgtgct ctttacagcg ttgcatgtca cccagtgcct 
20 1021 gtccttggtg cactgctgtg tcaaccccgt gctctacagc ttcatcaacc gcaactacag 
1081 gtacgagctg atgaaggcct tcatcttcaa gtactcggcc aaaacaggtc tcaccaagct 
1141 cattgatgcc tccagagtgt cagagacaga gtactctgcc ctggaacaga acaccaagtg 
1201 atccatcatc ctgcagaggt cggggggaca cgtgcatgtt gcaaatgggg cggctgggtc 
1261 ctgtggtttc ttcaagaaag caatgtagct ttgggtctgg ttgcttgagt ggtatgaaga 
25 1321 ggagaaggca tgtgaccatg cttcctgtct ctcattctcc agccaagatg gctgccgcct 
1381 ggatgcccat cctgacagct ggcagctggc agctggcagc tggcagcagg cagggctgtg 
1441 ctgtgctgtg ctgtgctgcg ctgtgctgtg ctgtgcagcc agagctgcgt atcaaagcca 
1501 gcaccaggac aggctcttat ggacatgtgt acagtagaat cttctgtgtt tcttcaagtt 
1561 tttacttggt gactttttgt atttaagttt taagacttta ttttctcact atggatgtac 
30 1621 cttataaatg tatttgaaag ctaaatatat tttaaatatt gtatgggagg tgtaaggctg 
1681 ttgtattcag accatgtagg cctcagatta gctggacttg agttttgact aaggatgaca 
1741 ttaattgtta gctgatttga aattatatat ataaatataa aaatatatat aaatttatgc 
1801 cagtctcggc tgaaaggttt tatttacaat agttttatat ctgtgcaatg tttaatgctg 
1861 gcacagcata tgaaacaaca actgcccagc aatgcagttt gttacatgaa ccatattgta 
35 1921 gagttacact tcgggctgca aacccgtgaa caagggaaac agagcgttct gttgatttgt 
1981 aagttatttt tttaataaag attttgtttc ctaaaacctc atgtgtgctt tgggtttgaa 
2041 agcatgtttg ggctttggat ggcatgcata gcactaacac atggaaaaac atgtcagaca 
2101 caggcaagaa gaagatgttc agtgggagag aacggccaga gagcccaacc atgggactgg 
2161 gtggatggat ctgagcaggg tgggtgctat tccttggttc ctgaaggaac attttcaaaa 
40 2221 acatttgagt tcaggggagg atgagcaggt cagaggtagg tgcatttcaa tggctgttga 
2281 gtgaaacttg ctgtaaaagt tgtgctgccc actgtctact caggaaatca tgtccttatc 
2341 ttaacaaaca tctgcctgcc gtcttcccca gccacatgtc catcagacca tcctgctgag 
2401 ggcatgaggt taggtgtcag cgagctgtgt atgctgtatg actctccaag cccaaagcac 
2461 cttcaggccc agcactctag agtcagaggc aggtggatct ctgtgaggtt gaggccaggg 
45 2521 ccaacctgat ctacacagtg agttccaggc tagccaggcc tgtgtagtaa gtctgtgtct 
2581 g 

SEQ ED NO: 225 

Amino acid sequence of mouse RDC1 encoded by the DNA sequence shown in SEQ ID NO: 
224. 

'50 MDVHLFDYAEPGNYSDINWPCNSSDCIVVDTVQ 
ANSVVVWVNIQMCTTGYDTHCYILmA^ 

HLIPSINLFOSIFPIjACMSVDRYLSITYFTGTSSYKKKMVRRWCILVWLLAFFVSLPDT 
Y YLKTVTSASNNETYCRS FYPEHS I KEWLIGMELVS VI LGPAVPFTI I AI FYFLLARAMS 
ASGDQEKHSSRKI I FS YVWFLVCWLPYHF WLLD I FS I LHY I PFTCQLENVLFTALHVT 
55 QCLSLVHCCVNPVLYSPINRNYRYELMKAFIFKYSAKTGLTKLIDASRVSBTEYSALEQN 
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TK 

SEQ1DN0: 226 

gi|16758073|ref]NM_053352.1| Rattus norvegicus chemokine orphan receptor 1 (Rdcl), 
mRNA 

5 1 cactgaagga gcctgcagca ctcaaagcea ggaaggcaaa tcacagccca ggcagccgcg 

61 aggtcacttg gttgctctcc tcaagaccat ggatgtgcat ctgtttgact atgtggaacc 
121 tgggaactac tcggacatca actggccctg taacagtagc gactgcatcg tcgtggacac 
181 cgtgcagtgt cccgccatgc ccaacaagaa tgtgctgctg tataccctct ccttcatcta 
241 cattttcatc ttcgtgatcg gtatgattgc caactccgtg gtggtctggg tgaatatcca 
10 301 ggccaagact acaggctacg acacacactg ctacatcttg aacctggcca ttgccgatct 
361 gtgggtcgtc atcaccatcc ctgtctgggt ggtcagtctc gtgcagcata accagtggcc 
421 catgggtgag ctcacgtgca agatcacaca cctcattttc tccatcaacc tctttgggag 
481 catcttcttc ctcgcatgca tgagcgtgga ccgctatctc tccatcacct acttcaccag 
541 cacctccagc tataagaaga agatggtacg ccgtgttgtc tgcgtcttgg tgtggctgct 
15 601 ggccttcttt gtgtccctgc ctgacaccta ctacctgaag acggtcacat ctgcttccaa 
661 caacgagacc tactgcaggt ccttctaccc cgagcacagc atcaaggagt ggctcattgg 
721 catggagctg, gtctccgtca tcttgggttt tgctgtcccc ttcaccatca ttgctatctt 
781 ctacttcctg ctcgccagag ccatgtcagc atccggtgac caggagaaac acagcagccg 
841 gaagatcatc ttctcctacg tggtggtctt cctggtgtgt tggctgccgt accattttgt 
20 901 ggttctgctg gacatcttct ctatcttgca ctacatcccg ttcacctgcc aactggagaa 
961 tgtgctcttt acagcgctgc acgtcacgca gtgcctgtcc ctggtgcact gctgtgtcaa 
1021 ccctgtgctc tacagcttca tcaaccgaaa ctacaggtac gagctgatga aggccttcat 
1081 cttcaagtac tcagccaaaa caggactcac caaactcatc gatgcctcca gagtgtcaga 
1141 gacagagtac tctgccctgg agcagaacac caagtgaccg tgctatagag gcatggggac 
25 1201 atgtgcatgt tgcaaatggg gcagctgggc cctgtggttt cttcaagaaa gcactgtagc 
1261 tttgggtctg gttgcttgag tggtatgaag aggagaagga ccatgcttcc tgtctctcat 
1321 tctccagcca agatggctgt cacctggaca ctcatcctga cagcttgcag cagacaggct 
1381 gccctgcgct gtgcagccag agctgtgtgt caaagccagc atcaggacag actcttctgg 
1441 acacgtgtac aatagaatct tttgtgtttc ttcaagtttt tacttggtga cttttgtatt 
30 1501 taagttttaa gactttattt tctcactatg gatgtacctt ataaatgcat ttgaaagcta 
1561 aatatatttt aaatattgta tgggaggtgt aaggctgttg tatttggacc atgtaggcct 
1621 cagattagca ggactctgag ttttgactaa ggatgacatt cattgttagc tgattcgaaa 
1681 tgatataaat atatataaaa tataaatata tatataaatt tatgccagtc ctggctggaa 
1741 tgttttattt acaatagttt tatatctgtg tggtgtttaa tgctggcaca gcatatggaa 
35 1801 cggaaactgc ccagcaatgc agtttgtgac acgaaccgta tgtagagtta cgtttcgggc 
1861 tgcaaacccg tgaacagagg aaacagcatt ctcttgattg taagttattt ttttaataaa 
1921 gatttttgtt tcctaaaaaa aaaaaaaaaa aaaaaaaa 

SEQ ID NO: 227 

Amino acid sequence of rat RDC1 encoded by the DNA sequence shown in SEQ ID NO: 226. 

40 MDVHLFDYVEPGNYSDINWPCNSSDCIVVDTVQCPAMPNKNVLLYTLSFIYIFIFVIGMI 
ANSWVWVNIQAKTTGYDTHCYILNI^ 

HLIPSINLFGSIFFIACMSVDRYLSITYFTSTSSYKKKMVRRWC^LVWLIjAFFVSLPDT 
YYLKTVTSASN1TCTYCRSFYPEKSIKEWLIGMBLVSVIIX3FAVPFTIIAIFYFLLARAMS 
ASGDQEKHSSRKIIFSYVWFLVCWLPYHFVVLLDIFSILHY1PFTCQLENVLFTALHVT 
45 QCLSLVHCCVNPVLYSFINRNYRYELMKAFIFKYSAKTGLTKLIDASRVSETBYSALBQN 
TK 

• n. , 

SEQ ID NO: 228 

gi|4759145|ref)NM_004787.1| Homo sapiens slit homolog 2 (Drosophila) (SUT2), mRNA 
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1 cagagcaggg tggagagggc ggtgggaggc gtgtgcctga gtgggctcta ctgccttgtt 
61 ccatattatt ttgtgcacat tttccctggc actctgggtt gctagccccg ccgggcactg 
121 ggcctcagac actgcgcggt tccctcggag cagcaagcta aagaaagccc ccagtgccgg 
181 cgaggaagga ggcggcgggg aaagatgcgc ggcgttggct ggcagatgct gtccctgtcg 
5 241 ctggggttag tgctggcgat cctgaacaag gtggcaccgc aggcgtgccc ggcgcagtgc 
301 tcttgctcgg gcagcacagt ggactgtcac gggctggcgc tgcgcagcgt gcccaggaat 
361 atcccccgca acaccgagag actggattta aatggaaata acatcacaag aattacgaag 
421 acagattttg ctggtcttag acatctaaga gttcttcagc ttatggagaa taagattagc 
481 accattgaaa gaggagcatt ccaggatctt aaagaactag agagactgcg tttaaacaga 
10 541 aatcaccttc agctgtttcc tgagttgctg tttcttggga ctgcgaagct atacaggctt 
601 gatctcagtg aaaaccaaat tcaggcaatc ccaaggaaag ctttccgtgg ggcagttgac 
661 ataaaaaatt tgcaactgga ttacaaccag atcagctgta ttgaagatgg ggcattcagg 
721 gctctccggg acctggaagt gctcactctc aacaataaca acattactag actttctgtg 
781 gcaagtttca accatatgcc taaacttagg acttttcgac tgcattcaaa caacctgtat 
15 841 tgtgactgcc acctggcctg gctctccgac tggcttcgcc aaaggcctcg ggttggtctg 
901 tacactcagt gtatgggccc ctcccacctg agaggccata atgtagccga ggttcaaaaa 
961 cgagaatttg tctgcagtgg tcaccagtca tttatggctc cttcttgtag tgttttgcac 
1021 tgccctgccg cctgtacctg tagcaacaat atcgtagact gtcgtgggaa aggtctcact 
1081 gagatcccca caaatcttcc agagaccatc acagaaatac gtttggaaca gaacacaatc 
20 1141 aaagtcatcc ctcctggagc tttctcacca tataaaaagc ttagacgaat tgacctgagc 
1201 aataatcaga tctctgaact tgcaccagat gctttccaag gactacgctc tctgaattca 
1261 cttgtcctct atggaaataa aatcacagaa ctccccaaaa gtttatttga aggactgttt 
1321 tccttacagc tcctattatt gaatgccaac aagataaact gccttcgggt agatgctttt 
1381 caggatctcc acaacttgaa ccttctctcc ctatatgaca acaagcttca gaccatcgcc 
25 1441 aaggggacct tttcacctct tcgggccatt caaactatgc atttggccca gaaccccttt 
1501 atttgtgact gccatctcaa gtggctagcg gattatctcc ataccaaccc gattgagacc 
1561 agtggtgccc gttgcaccag cccccgccgc ctggcaaaca aaagaattgg acagatcaaa 
1621 agcaagaaat tccgttgttc aggtaaagaa cagtatttca ttccaggtac agaagattat 
1681 cgatcaaaat taagtggaga ctgctttgcg gatctggctt gccctgaaaa gtgtcgctgt 
30 1741 gaaggaacca cagtagattg ctctaatcaa aagctcaaca aaatcccgga gcacattccc 
1801 cagtacactg cagagttgcg tctcaataat aatgaattta ccgtgttgga agccacagga 
1861 atctttaaga aacttcctca attacgtaaa ataaacttta gcaacaataa gatcacagat 
1921 attgaggagg gagcatttga aggagcatct ggtgtaaatg aaatacttct tacgagtaat 
1981 cgtttggaaa atgtgcagca taagatgttc aagggattgg aaagcctcaa aactttgatg 
35 2041 ttgagaagca atcgaataac ctgtgtgggg aatgacagtt tcataggact cagttctgtg 
2101 cgtttgcttt ctttgtatga taatcaaatt actacagttg caccaggggc atttgatact 
2161 ctccattctt tatctactct aaacctcttg gccaatcctt ttaactgtaa ctgctacctg 
2221 gcttggttgg gagagtggct gagaaagaag agaattgtca cgggaaatcc tagatgtcaa 
2281 aaaccatact tcctgaaaga aatacccatc caggatgtgg ccattcagga cttcacttgt 
40 2341 gatgacggaa atgatgacaa tagttgctcc ccactttctc gctgtcctac tgaatgtact 
2401 tgcttggata cagtcgtccg atgtagcaac aagggtttga aggtcttgcc gaaaggtatt 
2461 ccaagagatg tcacagagtt gtatctggat ggaaaccaat ttacactggt tcccaaggaa 
2521 ctctccaact acaaacattt aacacttata gacttaagta acaacagaat aagcacgctt 
2581 tctaatcaga gcttcagcaa catgacccag ctcctcacct taattcttag ttacaaccgt 
45 2641 ctgagatgta ttcctcctcg cacctttgat ggattaaagt ctcttcgatt actttctcta 
2701 catggaaatg acatttctgt tgtgcctgaa ggtgctttca atgatctttc tgcattatca 
2761 catctagcaa ttggagccaa ccctctttac tgtgattgta acatgcagtg gttatccgac 
2821 tgggtgaagt cggaatataa ggagcctgga attgctcgtt gtgctggtcc tggagaaatg 
2881 gcagataaac ttttactcac aactccctcc aaaaaattta cctgtcaagg tcctgtggat 
50 2941 gtcaatattc tagctaagtg taacccctgc ctatcaaatc cgtgtaaaaa tgatggcaca 
3001 tgtaatagtg atccagttga cttttaccga tgcacctgtc catatggttt caaggggcag 
3061 gactgtgatg tcccaattca tgcctgcatc agtaacccat gtaaacatgg aggaacttgc 
3121 cacttaaagg aaggagaaga agatggattc tggtgtattt gtgctgatgg atttgaagga 
3181 gaaaattgtg aagtcaacgt tgatgattgt gaagataatg actgtgaaaa taattctaca 
55 3241 tgtgtcgatg gcattaataa ctacacatgc ctttgcccac ctgagtatac aggtgagttg 
3301 tgtgaggaga agctggactt ctgtgcccag gacctgaacc cctgccagca cgattcaaag 
3361 tgcatcctaa ctccaaaggg attcaaatgt gactgcacac cagggtacgt aggtgaacac 
3421 tgcgacatcg attttgacga ctgccaagac aacaagtgta aaaacggagc ccactgcaca 
3481 gatgcagtga acggctatac gtgcatatgc cccgaaggtt acagtggctt gttctgtgag 
60 3541 ttttctccac ccatggtcct ccctcgtacc agcccctgtg ataattttga ttgtcagaat 
3601 ggagctcagt gtatcgtcag aataaatgag ccaatatgtc agtgtttgcc tggctatcag 
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3661 ggagaaaagt gtgaaaaatt ggttagtgtg aattttataa acaaagagtc ttatcttcag 
3721 attccttcag ccaaggttcg gcctcagacg aacataacac ttcagattgc cacagatgaa 
3781 gacagcggaa tcctcctgta taagggtgac aaagaccata tcgcggtaga act ct at egg 
3841 gggcgtgttc gtgecagcta tgacaccggc tctcatccag cttctgccat ttacagtgtg 
5 3901 gagacaatca atgatggaaa cttccacatt gtggaactac ttgccttgga tcagagtctc 
3961 tctttgtccg tggatggtgg gaaccccaaa atcatcacta acttgtcaaa gcagtccact 
4021 ctgaattttg actctccact ctatgtagga ggcatgccag ggaagagtaa cgtggcatct 
4081 ctgcgccagg cccctgggca gaaeggaace agcttccacg gctgcatccg gaacctttac 
4141 atcaacagtg agetgeagga cttccagaag gtgccgatgc aaacaggcat tttgcctggc 

10 4201 tgtgagccat gecacaagaa ggtgtgtgcc catggcacat gccagcccag cagccaggca 
4261 ggcttcacct gcgagtgcca ggaaggatgg atggggcccc tctgtgacca aeggaccaat 
4321 gacccttgcc ttggaaataa atgegtacat ggcacctgct tgcccatcaa tgcgttctcc 
4381 tacagctgta agtgcttgga gggccatgga ggtgtcctct gtgatgaaga ggaggatctg 
4441 tttaacccat gecaggegat caagtgcaag caegggaagt gcaggctttc aggtctgggg 

15 4501 cagccctact gtgaatgcag cagtggatac aegggggaca gctgtgatcg agaaatctct 
4561 tgtcgagggg aaaggataag agattattac caaaagcagc agggctatgc tgcttgccaa 
4621 acaaccaaga aggtgtcccg attagagtgc agaggtgggt gtgeaggagg gcagtgctgt 
4681 ggaccgctga ggagcaagcg geggaaatae tctttcgaat geactgaegg ctcctccttt 
4741 gtggacgagg ttgagaaagt ggtgaagtgc ggctgtacga ggtgtgtgtc ctaaacacac 

20 4801 tcccggcagc tctgtctttg gaaaaggttg tatacttctt gaccatgtgg gactaatgaa 
4861 tgcttcatag tggaaatatt tgaaatatat tgtaaaatac agaacagact tatttttatt 
4921 atgagaataa agactttttt tetgeatttg 

SEQIDNO:229 

Amino acid sequence of human SLIT2 encoded by the DNA sequence shown in SEQ ID NO: 
25 228. 

^GVGWQMLSLSLGLVLAILNKVAPQACPAQCSCSQSTVDCHGLALRSVPRNIPRNTERL 
DLNGNNI TR I TKTD F AGL RHLR VLQLMENK I S T I ERGAFQDL KE LERLRLNRNHLQL FP E 
LLFLGTAKLYRLDLSENQIQAIPRKAFRGAVDIKNIiOLDYNQISCIEDGAFRALRDLEVL 
TLNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHIjAWLSDWLRQRPRVGLYTO 
30 HLRGHNVAEVQKREFVCSGHQSFMAPSCSVLHCPAACTCSNNI VDCRGKGLTE I PTNLPB 
TI TE I RLEQNTI KVI PPGAFS P YKKLRR I DLSNNQ I S ELAPD AFQGLRS LNS LVL YGNKI 
TELPKSLFEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLR 
AIQTMHLAQNPFICDCHLKWLADYLHTNPIETSGARCTSPRRLA^ 

KEQYF I PGTEDYRS KLSGDCFADLACPEKCRCEGTTVDCSNQKLNKI PEH I PQYTAELRL 

35 NNNEFTVLEATGIFKKLPQLRKINFS1WKITDIEEGAFEGASGVNEILLTSNRLENVQHK 
MFKGLESLKTLMLRSNRITCVQNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSIiSTLN 
LLANPFNCNC YLAWLGEWLRKKR I VTGNPRCQKP YFLKEI P IQDVAI QD FTCDDGNDDNS 
CSPLSRCPTECTCLDTVVRCSNKGLKVLPKGIPRDVTELYIiDGNQFTLVPKELSNYKHLT 
LIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDISW 

40 PEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSEYKEPGIARCAGPGEMADKLLLTT 
PSKKFTCQOPVDVNILAKC^PCLSNPCKNDGTCNSDPVDFYRCTCPYGFKGQDCDVPIHA 
CISNPCKHGGTCHLKEGBEDGFWCICADGFEGENCEVNVDDCEDNDCENNSTCVDGINNY 
TCLCPPEYTGELCEEKLDFCAQDLNPCQHDSKCILTPKGFKCDCTPGYVGEHCDIDFDDC 
QDNKCKNGAHCTDAVNGYTCICPEGYSGLFCEFSPPMVLPRTSPCDNFDCQNGAQCIVRI 

45 NEPICQCLPGYQGEKCEKLVSVNFINKESYLQIPSAKVRPQTNITLQIATDEDSGILLYK 
GDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGN 
PKI ITNLSKQSTLNFDS PLYVGGMPGKSNVASLRQAPGQNGTSFHGC IRNLY INSELQDF 
QKVPMQTGILPGCEPCHKKVCAHGTCQPSSQAGFTCECQEGWMGPLCDQRTNDPCIiGNKC 
VHGTCLPINAFSYSCKCLEGHGGVLCDEEEDliFNPCQAlKCKHGKCRLSGLGQPYCECSS 

50 GYTCDSCDREISCRGERIRDYYQKQQGYAACQTTKKVSRLECRGGCAGGQCCGPLRSKRR 
KYS FECTDGSS FVDEVE KVV KCGCTRCVS 

SEQ ID NO: 230 

Amino acid sequence of human SLIT2, a soluble active secreted form derived from SEQ ID 
NO:229. 
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QCSCSGSTVDCHGLALRSVPIWIPRNTERLDIjNGN^ 

ISTIERQAFQDLKELERLRLNlUmLQLFPEliLPUJTAKLYRLDLSENQIQMPRKAFRQA 
VDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRUISNN 
LYCDCHLAWI^DWLRQRPRVGLYTQCMGPSHLRGHNVAEVQKREFVCSGHQSFMAPSCSV 
5 LHCPAACTCSNNIVDCRGKGIiTEIPTNLPETITBIRIjEQNTIKVIPPGAFSPYKKLRRID 
LSNNQISEIiAPDAEX^LRSLNSIiVLYGNKITELPKSLFEGLPSLQLLLLNANKINCLRVD 
AFQDLHNLNLLSLYDNKLOTIAKGTFS 

ETSGARCTSPRRIiANKRIGQIKSKKFRCSAKEQYFIPGTEDYRSKLSGDCFADLACPEKC 
RCEGTTTOCSNQKIiNKIPEHIPQYTABLRLNNNEFTVLEATGIFKKLPQM 
1 0 TDIEEGAFEGASGVNEI LIiTSNRLENVQHKMFKGLESLKTLMLRSNRITCVGNDS F IGLS 
SVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYI^ 

CQKPYFX^KE I P I QDVAI QD FTCDDGNDDNSCS PLSRCPTECTCLDTWRCSNKGLKVLPK 
GIPRDVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSMQSFSNMTQLLTLILSY 
NRLRCIPPRTFDGLKSliRI^SIjHGNDISWPEGAFNDLSALSHIAIGANPIiYCDCNMQ 
1 5 SDWVKSEYKEPGIARCAGPGEMADKLLLTTPSKKFTCQGPVDWILAKCNPCLSNPCKND 
GTOTSDPVDFYRCTCPYGFKGQDCDVPIHACISNPCKHGGTCHLKEGEEDGFWCICADGF 
EGENCEVNVDDCEDNDCENNSTCVTOINIfrrCLCPPEYTGELCEEKLDFCAQDIjNPCQHD 
SKCILTPKGFKCDCTPGYVGEHCDIDFDDCQDNKCKNGAHCTDAVNGYTCICPEGYSGLF 
CEFSPPMVLPR 

20 SEQIDNO:231 

gi|30794373|reflNM_178804.2| Mus musculus slit homolog 2 (Drosophila) (Slit2), mRNA 

1 tattcagaac ttaagttgcc cacggatctt ctgctctgct agaaaggctt gaagagcaga 
61 ggaaagacct gtgcttctgt cagctctcct gccccatatc actgttccag attactgtgt 
121 gagcatctcc ccgggtgctg tgggctgcaa gcccagcgcc aggcactggg cctcggacac 
181 tgcccggttt ttacacaacc gaaagctcaa gagaagtcct tcaaagcaag gagtctatta 
241 gggaagatga gtggcattgg ctggcagaca ctgtccctat cgctggggtt agtgttgtcg 
301 atcttgaaca aggtggcgcc gcaggcgtgc ccggcccagt gctcctgttc aggcagcacg 
361 gtggactgtc atgggctggc actgcgcagt gtgcccagga atatcccccg caacaccgag 
421 agactggatt tgaatggaaa taacatcacg aggatcacga agacagattt tgctggtctc 
481 aggcacctca gagttcttca gctcatggag aacagaatca gcaccatcga gaggggagca 
541 ttccaggatc ttaaggagct ggaaagactg cgtttaaaca gaaataacct tcagttgttt 
601 cctgagctgc tgtttctcgg gactgcgaag ctctaccggc ttgatctcag tgaaaatcaa 
661 attcaagcaa ttccaaggaa ggctttccgt ggggcagttg acattaaaaa cctgcaactg 
721 gattacaacc agatcagctg cattgaagat ggggcgttca gagctctacg agatctggaa 
781 gtgctcactc tgaacaataa caatattact agactttcag tggcaagttt caaccatatg 
841 cctaaactta ggacatttcg actccactcg aacaacttgt actgcgactg ccacctagcc 
901 tggctctcag actggcttcg ccaaaggcca cgggtgggct tgtacactca gtgtatgggc 
961 ccatcccacc tgaggggcca caatgtagca gaggttcaaa aacgagagtt tgtctgcagt 
1021 ggtcaccagt cattcatggc tccctcctgc agtgtgctgc actgccccgc tgcttgtacc 
1081 tgtagcaaca acattgtaga ctgccgaggg aaaggtctca ctgagatccc cacaaatctg 
1141 cctgagacca tcacagaaat acgtttggaa cagaactcca tcagggtcat ccctccagga 
1201 gccttctcac catacaaaaa gcttagacga ctagacctga gcaacaacca gatctctgaa 
1261 cttgcaccag atgccttcca aggactgcgc tctctgaatt cacttgtcct gtatggaaat 
1321 aaaatcacag aactcccaaa aagtttattc gaaggactat tttccttgca gctactatta 
1381 ttgaatgcca acaagataaa ctgccttcgg gtagatgctt ttcaggacct gcacaacttg 
1441 aaccttctct ccttatatga caataagctt cagacggttg ccaagggcac cttctcagcc 
1501 ctcagagcca tccaaactat gcatttggcc cagaatcctt tcatttgtga ctgccatctc 
1561 aagtggctag cggattatct ccacaccaac ccaattgaga ccagcggtgc ccgttgcacc 
1621 agcccccgcc gcctggcaaa caaaagaatt ggacagatca aaagcaagaa attccgttgt 
1681 tcaggtacag aagattatcg atcaaaatta agtggagact gctttgcaga cttggcttgt 
1741 cctgagaagt gtcgctgtga agggaccaca gtagactgct ccaatcaaag actcaacaaa 
1801 atccctgacc atattcccca gtracacagca gagctgcgtc tcaataataa tgaattcaca 
1861 gtgttagaag ccacgggaat atttaagaaa cttcctcagt tacgtaaaat caactttagc 
1921 aacaataaga tcacggatat cgaggagggt gcatttgaag gcgcgtctgg tgtgaatgaa 
1981 attcttctca ccagtaaccg tttggaaaat gttcagcata agatgttcaa aggactggag 
2041 agcctcaaaa cattgatgct gagaagtaat cgaataagct gtgttgggaa cgacagtttc 
2101 ataggactcg gctctgtgcg tctgctctct ttatatgaca atcaaattac cacagtggca 
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2161 ccaggagcat ttgattctct ccattcatta tccactctaa acctcttggc caatcctttc 
2221 aactgtaact gtcacctggc atggctggga gaatggctca gaaggaaaag aattgtaaca 
2281 ggaaatcctc gatgccaaaa accctacttc ctgaaggaaa tcccaatcca ggatgtagcc 
2341 attcaggact tcacctgtga tgatggaaat gatgacaata gttgctctcc actctcccgt 
5 2401 tgtccttctg aatgtacctg cttggataca gtggtacgat gtagcaacaa gggcttgaag 
2461 gttttgccta aaggtattcc aaaagatgtc acagagctgt atctggatgg gaaccagttt 
2521 acgctggtcc cgaaggaact ctctaactac aaacatttaa cacttataga cttaagtaac 
2581 aaccgaataa gcaccctttc caatcaaagc ttcagcaaca tgacccagct tctcacctta 
2641 atcctcagtt acaaccgtct gagatgtatc cctccacgaa cctttgatgg attgaagtct 
10 2701 cttcggttac tgtctttaca tggaaatgac atttctgttg tgcctgaagg tgccttcaat 
2761 gacttgtcag ccttgtcaca cttagcgatt ggagccaacc ctctttactg tgattgtaac 
2821 atgcagtggt tatccgactg ggtgaagtcg gaatataagg aacctggaat tgcacgctgt 
2881 gccggccctg gagaaatggc agataaatta ttactcacta ctccctccaa aaaatttaca 
2941 tgtcaaggtc ccgtggatat cactattcaa gccaagtgta atccctgctt atcaaatcca 
IS 3001 tgtaaaaatg atggcacctg taacaatgac cccgttgatt tttatcgatg tacctgccca 
3061 tatggattca agggtcagga ctgtgatgtc cccattcatg cttgtatcag taatccatgt 
3121 aaacatggag gaacttgtca cttaaaggaa ggagagaatg ctggattctg gtgcacttgt 
3181 gctgatgggt ttgaaggaga aaactgtgaa gtcaatattg atgattgtga agataatgat 
3241 tgtgaaaata attctacatg cgttgatgga attaacaact acacatgtct ttgcccaccg 

20 33 01 gaatacacag gtgaactgtg tgaggaaaag ctggacttct gtgcacaaga cttgaatccc 
3361 tgccagcatg actccaagtg catcctgact ccaaagggat tcaagtgtga ctgcactcca 
3421 ggatacattg gtgagcactg tgacattgac tttgatgact gccaagataa caagtgtaaa 
3481 aacggtgctc actgcacaga tgccgtgaac ggatacacgt gcgtctgtcc tgaaggctac 
3541 agtggcttgt tctgtgagtt ttctccaccc atggtcctcc ctcgcaccag cccctgtgat 

25 3601 aattttgatt gccagaatgg agcccagtgt atcatcagga taaatgaacc aatatgccag 
3661 tgtttgcctg gctacctggg agagaagtgt gagaaattgg tcagtgtgaa ttttgtaaac 
3721 aaagagtcct atcttcagat tccttcagcc aaggttcggc ctcagacaaa catcacactt 
3781 cagattgcca cagatgaaga cagcggcatc ctcttgtata aaggtgacaa agaccacatt 
3841 gccgtggaac tctatagagg gcgagttcga gccagctatg acaccggctc tcatccggct 

30 3 901 tctgccattt acagtgtgga gacaatcaat gatggaaact tccacattgt ggagctactg 
3961 accctggatt ccagtctttc cctctctgtg gatggaggaa gccctaaagt catcaccaat 
4021 ttgtcaaaac aatctactct gaatttcgac tctccactct atgtaggagg catgcctggg 
40B1 aaaaataacg tggcatccct gcgccaggcc cctgggcaaa atggcaccag cttccatggc 
4141 tgtatccgga acctttacat taacagtgag ctgcaggact tccggaaaat gcctatgcaa 

35 4201 accggaattc tgcctggctg tgaaccatgc cacaagaaag tatgtgccca tggcatgtgc 
4261 cagcccagca gccaatcagg cttcacctgt gaatgtgagg aagggtggat ggggcccctc 
4321 tgtgaccaga gaaccaatga tccctgcctc ggaaacaaat gtgtgcatgg gacctgcctg 
4381 cccatcaatg ccttctccta tagttgcaag tgcctggagg gccatggcgg tgtcctctgt 
4441 gatgaagaag aagatctctt taacccctgc cagatgatca agtgcaagca tgggaagtgc 

40 4501 aggctttctg gagtgggcca gccctattgt gaatgcaaca gtggattcac cggggacagc 
4561 tgtgatagag aaatttcttg tcgaggggaa cggataaggg actattacca gaagcagcag 
4621 ggttacgctg cctgtcaaac aactaagaaa gtatctcgct tggaatgcag aggcgggtgc 
4681 gctggaggcc agtgctgtgg acctctgaga agcaagaggc ggaaatactc tttcgaatgc 
4741 acagatggct cctcatttgt ggacgaggtt gagaaagtgg tgaagtgcgg ctgcgcgaga 

45 4801 tgtgcctcct aagcgcgtct ctagaagctt ctagcttcgg cgaaggttgt acacttcttg 
4861 accatgttgg actaattcat gcttcataat ggaaatattt gaaatatatt gtaaaataca 
4921 gaacagactt atttttatta tgataataaa gacttgtctg catttggaaa aaaataataa 
4981 taaaagccac gcttgtacta aagctccccc tacactggag aagtgtggag aaaagccaca 
5041 cttggaggca ttagtgaagc ggtgggtacc attgcaacac ggagccatct ttggaacacg 

50 5101 ccaatactag cagaagcaca tctacaagag cctgacatgg gactgtacgc tcatatgcgc 
5161 ttgcctagag cgtcaaaccc gtgaccccct tcacatcagt tcccaaggac acatcgagca 
5221 caagtgtgtg agtgagactg agaggtagat gaatgggaat ctagaatctg tagataggtc 
5281 caagggatga gatatatttt gtgcaagatc taaagtgtcc tgacatctgg gtttcattcc 
5341 gaaagaaatg ggtgtggagt gctgtgatgt attttattgt ctttggcaat gacagcgcat 

55 5401 aggcagaaac agcacacgaa ggtgtttagc taccagtttc cagtattaaa ttttttgtaa 
5461 tataaatgac aaaggagatg ataaagaacc* aatagattat tgataaataa aaattagtaa 
5521 taatatgaat ttttgtttct atgagttcta aatatcccta tacagtattc attttcaatg 
5581 aggaatattt attgtattaa agtacatttt acttaacgat ttagatcatc attttggact 
5641 gtttcttgat gctttaatat atattaattt ataattatcc tgatattttt gtacattttc 

60 5701 aaactttaaa aaaaaatcag gatttttttg gcaatagtac taacatagga tgttattttg 
5761 ttatctcggg ataagtatat gttgttctgt tggctgacct tgatgtctga ccattgatgc 
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5821 cactgaactg tggcctcctt caggacactt gcagaaggtt tgcaaagtct cagagtagaa 

5881 acagcaagtg aatctatctg ccacatgtcc tcacaagaag aaagaactta tctgtagcaa 

5941 tggccagtaa gaagtagatg ttttacaaat taattaataa acatttccct gccatgcttt 

6001 gttggggagg gtgggagaag aaagactgtt atgtcaaaaa ttgtaatttc tttccaaaca 
5 6061 ataaaattcc tttattaacg taa 

SEQIDNO: 232 

Amino acid sequence of mouse SLIT2 encoded by the DNA sequence shown in SEQ ID NO 
231. 

MSGIGWQTLSLSLGLVLSILNKVAPQACPAQCSCSGSTVDCHGLAL 
1 0 DLNGNNITRITKTDFAGLRHIiRVLQIaMENRISTlERGAFQDLKELERLRLNRNNLQLPPE 
LLFLGTAKLYRLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVL 
TLJJNNNITRLSVASFTIHMPKLRTFRLHSNinjYCDCMLAWLSDWLRQ 
HLRGHNVAEVQKREFVCSGHQSFMAPSCSVliHCPAACTCSNNIVDCRGKGLTEIPTNLPE 
TITEIRLEQNSIRVIPPGAFSPYKKLRRLDLSNNQISEIAPDAFQGLRSLNSLVLYGNKI 
15 TELPKSLFEGLFSLQLXiLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTVAKGTFSALR 
AIQTMHIAQNPFICDCHLKWLADYUnTOPIOT 

TEDYRSKLSGDCFADLACPEKCRCEGTT\mCSNQRLNKIPDHIPQYTAELRLNNNEFT^ 

EATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRLENVQHKMFKGL^ 

KTLMLRSimiSC^GNDSFIGLGSVRI^SLYDNQITTVAPGAFDSLHSLSTI^ 

20 NCHLAWLGEWLRRKRIVTGNPRCQKPYFLKEIPIQDVAIQDFTCDDGNDDNSCSPLSRCP 
SECTCLDTVVRCSNKGLK^PKGIPKDVTELYLDGNQFTLVPKELSNYKHLTLIDLSNWR 
ISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSIjHGNDISWPEGAFNDL 
SALSHLAIGANPLYCDCNMQWL6DWVKSEYKEPGIARCAGPGEMADKLLLTTPSKKFTCQ 
GPWITIQAKCNPCLSNPCK^nxSTCNNDPVDFYRCTCPYGFKGQDCDVPIHACISNPCKH 

25 GGTCHLKEGENAGFWCTCADGFEGENCEVNIDDCEDNDCENNSTCVDGINNYTCLCPPEY 
TGELCEEKXiDFCAQDLNPCQHDSKCILTPKGFKCDCTPGYIGEHCDIDFDDCQDNKCKNG 
AHCTDAVNGYTCVCPEGYSGLFCE FS PPM VL PRTS PCDNFDCQNGAQC I IR I NEP I CQCL 
PGYLGEKCEKLVSVNFVNKESYLQIPSAKVRPQTNITLQIATDEDSGILLYKGDKDHIAV 
ELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLTLDSSLSLSVDGGSPKVITNIiS 

30 KQSTLNFDSPLYVGGMPGKNNVASLRQAPGQNGTSFHGCIRNLYINSELQDFRKMPMQTG 
ILPGCEPCHKKVCAHGMCQPSSQSGFTCECEEGWMGPLCDQRTNDPCLGNKCVHGTCLPI 
NAFSYSCKCLEGHGGVLCDEEEDLFNPCQMIKCKHGKCRLSGVGQPYCECNSGFTGDSCD 
REISCRGERIRDYYQKQQGYAACQTTKKVSRLECRGGCAGGQCCGPLRSKRRKYSFECTD 
GSSFVDEVEKWKCGCARCAS 

35 SEQ ID NO: 233 

ENSRNOT00000005477 cDNA sequence, EnsEMBL transcript [Rattus norvegicus] 

1 atgagtggca ttggctggca gacactgtcc ctatctctgg cgttagtgtt gtcgatcttg 
61 aaccaggtgg cgcctcaggc gtgcccggcc cagtgctcct gttcaggcag cacagtggac 
121 tgtcatgggc tggcactgcg cagtgtgccc aggaatatcc cccgcaacac ggagagactg 

40 181 gatttgaatg gaaataacat cacaaggatc acgaagacag attttgcggg tctcagacac 
241 ctcagagttc ttcagctcat ggagaacaag atcagcacca tcgagagggg agcattccag 
301 gatcttaagg agctagaaag actgcgttta aacagaaata accttcagtt gtttcctgag 
361 ctgctgtttc ttgggactgc gaagctctac cggcttgatc tcagtgaaaa tcagattcaa 
421 gcaattccaa ggaaggcttt ccgtggtgca gttgacatta aaaatctgca gttggattac 

45 481 aaccagatca gctgcattga agatggggca ttccgagctc tgcgagatct ggaagtgctc 
541 actctgaaca ataacaatat tactagactt tcagtggcaa gtttcaacca tatgcctaaa 
601 cttaggacat ttcgactcca ctccaacaac ctatactgcg actgccacct ggcctggctc 
661 tcggactggc ttcgccaaag gccacgggtg ggcttgtaca ctcagtgtat gggcccatcc 
721 cacctgaggg gccataatgt agcagaggtt caaaaacgag agtttgtctg cagtggtcac 

50 781 cagtcattca tggctccctc ctgcagtgtg ctgcactgcc cgattgcttg tacctgtagc 
841 aacaacattg tagactgccg agggaaaggt ctcactgaga tccccacaaa tctgcctgag 
901 accatcacag aaatacgttt ggaacagaac tccataaggg tcatccctcc aggagcattc 
961 tcaccataca aaaagcttcg acgactagac ctgagtaata accagatctc ggaacttgct 
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1021 ccagatgcct tccaaggact gcgttctctg aattcccttg tcctgtatgg aaataaaatc 
1081 acagaactcc caaaaagttt atttgaagga ctgttttcct tacagctact attattgaat 
1141 gccaacaaga taaactgcct tcgggtagat gcttttcagg acctgcacaa cttgaacctt 
1201 ctctccttat acgacaataa gcttcagact gttgccaagg gcaccttctc agctctcaga 
5 1261 gccatccaaa ctatgcattt ggcccagaat cctttcattt gtgactgcca tctcaagtgg 
1321 ctagcggatt atctccacac caacccaatt gagaccagcg gtgcccgttg caccagtccc 
1381 cgccgcctgg ctaacaaaag aattggacag atcaaaagca agaaattccg ttgttcaggt 
1441 acagaagatt atcgatcaaa attaagtgga gactgctttg cagacttggc ttgtcctgaa 
1501 aaatgtcgct gtgaagggac cacagtagac tgctccaatc aaaaactcaa caaaatccca 
10 1561 gaccatattc cccagtacac agcagagctg cgtctcaata ataatgaatt cacagtgtta 
1621 gaagccacgg gaatatttaa gaaacttcct caattgcgta aaatcaacct tagcaacaat 
1681 aagatcactg atatcgagga gggggcattc gaaggtgcgt ctggtgtgaa tgagattctg 
1741 cttaccagta accgtttgga aaatgttcag cataagatgt tcaaaggatt ggagagcctc 
1801 aaaacattga tgctgagaag taatcgaata agctgtgtgg gaaacgacag tttcacagga 

15 1861 ctcggttctg tgcgtctgct ctctttatat gacaatcaaa ttaccacagt tgcaccagga 
1921 gcatttggta ctctccattc attatctaca ctaaacctct tggccaatcc tttcaactgt 
1981 aactgtcacc tggcatggct tggagaatgg ctcagaagga aaagaattgt aacaggaaat 
2041 cctcgatgcc aaaaacccta cttcttgaag gaaataccaa tccaggatgt agccattcag 
2101 gacttcacct gtgatgacgg aaacgatgat aatagctgct ctccactctc ccgttgtcct 

20 2161 tcggaatgta cttgcttgga tacagtagta cgatgtagca acaagggctt gaaggtctta 
2221 cctaaaggca ttccaagaga tgtcacagaa ctgtatctgg atgggaacca gtttacactg 
2281 gtcccgaagg aactctccaa ctacaaacat ttaacactta tagacttaag taacaacaga 
2341 ataagcaccc tttccaacca aagcttcagc aacatgaccc aacttctcac cttaattctc 
2401 agttacaacc gtctgagatg tatccctcca cggacctttg atggattgaa atctcttcgt 

25 2461 ttactgtctc tacatggaaa tgacatttct gtcgtgcctg aaggtgcctt tggtgacctt 
2521 tcagccttgt cacacttagc aattggagcc aaccctcttt actgtgattg taacatgcag 
2581 tggttatccg actgggtgaa gtcggaatat aaggaacctg gaattgcccg ctgtgccggt 
2641 cccggagaaa tggcagataa attgttactc acaactccct ccaaaaaatt tacatgtcaa 
2701 ggtcctgtgg atgttactat tcaagccaag tgtaacccct gcttgtcaaa tccatgtaaa 

30 2761 aatgatggca cctgtaacaa tgacccggtg gatttttatc gatgcacctg cccatatggt 
2821 ttcaagggcc aggactgtga tgtccccatt catgcctgta tcagtaatcc atgtaaacat 
2881 ggaggaactt gccacttaaa agaaggagag aatgatggat tctggtgtac ttgtgctgat 
2941 gggtttgaag gagaaagctg tgacatcaat attgatgatt gcgaagataa tgattgtgaa 
3001 aataattcta catgcgttga tggaattaac aactacacgt gtctttgccc accggaatac 

35 3061 acaggcgaac tgtgtgagga aaaactggac ttctgtgcac aagacctgaa tccctgccag 
3121 catgactcca agtgcatcct gacgccaaag ggattcaagt gtgactgcac tccgggatac 
3181 attggtgagc actgtgacat cgactttgat gactgccaag ataacaagtg caaaaacggt 
3241 gctcattgca cagatgcagt gaacggatac acatgtgtct gtcctgaagg ctacagtggc 
3301 ttgttctgtg agttttctcc acccatggtc ctccctcgca ccagcccctg tgataatttt 

40 3361 gattgtcaga atggagccca gtgtatcatc agggtgaatg aaccaatatg ccagtgtttg 
3421 cctggctact tgggagagaa gtgtgagaaa ttggtcagtg tgaattttgt aaacaaagag 
3481 tcctatcttc agattccttc agccaaggtt cgacctcaga caaacatcac acttcagatt 
3541 gccacagatg aagacagcgg catcctcttg tacaagggtg acaaggacca cattgctgtg 
3601 gaactctatc gagggcgagt tcgagccagc tatgacaccg gctctcaccc ggcttctgcc 

45 3661 atttacagtg tggagacaat caatgatgga aacttccaca ttgtagagct actgaccctg 
3721 gattcgagtc tttccctctc tgtggatgga ggaagcccta aaatcatcac caatttgtca 
3781 aaacaatcta ctctgaattt cgactctcca ctttacgtag gaggtatgcc tgggaaaaat 
3841 aacgtggctt cgctgcgcca ggcccctggg cagaacggca ccagcttcca tggctgtatc 
3901 cggaaccttt acattaacag tgaactgcag gacttccgga aagtgcctat gcaaaccgga 

50 3961 attctgcctg gctgtgaacc atgccacaag aaagtgtgtg cccatggcac atgccagccc 
4021 agcagccaat caggcttcac ctgtgaatgt gaggaagggt ggatggggcc cctctgtgac 
4081 cagagaacca atgatccctg tctcggaaac aaatgtgtac atgggacctg cttgcccatc 
4141 aacgccttct cctacagctg caagtgcctg gagggccacg gcggggtcct ctgtgatgaa 
4201 gaagaagatc tgtttaaccc ctgccaggtg atcaagtgca agcacgggaa gtgcaggctc 

55 4261 tctgggctcg ggcagcccta ttgtgaatgc agcagtggat tcaccgggga cagctgtgac 
4321 agagaaattt cttgtcgagg ggaacggata agggattatt accaaaagca gcagggttac 
4381 gctgcctgtc aaacgactaa gaaagtatct cgcttggagt gcagaggcgg gtgtgctggg 
4441 gggcagtgct gtggacctct gagaagcaag aggcggaaat actctttcga atgcacagat 
4501 ggatcttcat ttgtggacga ggtcgagaag gtggtgaagt gcggctgcac gagatgtgcc 

60 4561 tcctaa 
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SEQIDNO: 234 

Amino acid sequence of rat SLIT2 encoded by the DNA sequence shown in SEQ ID NO: 233, 

MSGIGWQTLSLSIiAL\njSILNQVAPQACPAQC5CSGST\n3CHOIiAIiRSVPRNXPRNTERL 
DLNGNNITRITKTDFAGLRHLRVLQI^ 
5 LL FLGTAKlt YRLDL S ENQ I QAI PRKAFRGAVD I KNLQLD YNQ I SCI EDG AFRALRDLE VL 
TLNNNNITRLSVASFNHMPKLRTFRLHSNNLYCT 

HLRQHNVAEVQKREFVCSGHQSFMAPSCSVLHCPIACTCSNNIVDCRGKGLTEIPTNLPE 
TITEIRLEQNSIRVIPPGAFSPYKKLRRLDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 
TELPKSLFEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTVAKGTFSALR 
10 AIOTNHIAQNPFICDCHLKWIiADYLHTNPIETSGARCTSPRRLANKRIGQIKSKKFRCSG 
TEDYRSKLSGDCFADLACPEKCRCEGTTVDCSNQKIiNKIPDHIPQYTAELRIilOWEFTVL 
EATGIFKKLPQIiRKINXiSNNKITDIEEGAFEGASGVNEILLTSNRLENVQHKMFKGLESL 
KTLMLRSNRISCVGITOSFTGLGSVRLLSLYDNQITTVAPGA 

NCHLAWLGEWLRRKRI VTGNPRCQKPYFLKE I PIQDVAIQDFTCDDGNDDNSCS PLSRCP 
1 5 SECTCLDTWRCSNKGLKVLPKGIPRDVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNR 
ISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDISVVPEGAFGDL 
SALSHLAIGANPLYCDCNMQWLSDWVKSEYKEPGIARCAQPGEhlADKLLLTTPSKKFTCQ 
GPVDVTIQAKCNPCLSNPCKNDGTCNNDPVDFYRCTCPYGFKGQDCDVPIHACISNPCKH 
GGTCHLKEGENDGFWCTCADGFEGESCDINIDDCEDNDCENNSTCVDGINNYTCLCPPEY 
20 TGELCTEIO^FCAQDLNPCQHDSKCILTPKGFKCDCTPGYIGEHCDIDFDDCQDNKCKNG 
AHCTDAVNGYTCVCPEGYSGLFCEFSPPMVLPRTSPCDNFDCQNGAQCIIRVNEPICQCL 
PGYLGBKCEKLVSVNFVNKESYLQIPSA1CVRPQTNITLQIATDEDSGILLYKGDKDHIAV 
ELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLTIJDSSLSLSVDGGSPKIITNLS 
KQSTLNFDSPLYVGGMPGKNNVASLRQAPGQNGTSFHGCIRNLYINSELQDFRKVPMQTG 
25 ILPGCEPCHKKVCAHGTCQPSSQSGFTCECEEGWMGPLCDQRTNDPCLGNKCVHGTCLPI 
NAFSYSCKCLEGHGGVLCDEEEDLFNPCQVIKCKHGKCRLSGLGQPYCECSSGFTGDSCD 
REISCRGERIRDYYQKQQGYAACQTTKKVSRLECRGGCAGGQCCGPLRSKRRKYSFECTD 
GSSFVDEVEKWKCGCTRCAS 

SEQ ID NO: 235 

30 gi|23238206|ref|NM_014452.3| Homo sapiens tumor necrosis factor receptor superfamily, 
member 21 (TNFRSF21), mRNA 

1 gccaccacgt gtgtccctgc gcccggtggc caccgactca gtccctcgcc gaccagtctg 
61 ggcagcggag gagggtggtt ggcagtggct ggaagcttcg ctatgggaag ttgttccttt 
121 gctctctcgc gcccagtcct cctccctggt tctcctcagc cgctgtcgga ggagagcacc 
35 181 cggagacgcg ggctgcagtc gcggcggctt ctccccgcct gggcggccgc gccgctgggc 
241 aggtgctgag cgcccctaga gcctcccttg ccgcctccct cctctgcccg gccgcagcag 
301 tgcacatggg gtgttggagg tagatgggct cccggcccgg gaggcggcgg tggatgcggc 
361 gctgggcaga agcagccgcc gattccagct gccccgcgcg ccccgggcgc ccctgcgagt 
421 ccccggttca gccatgggga cctctccgag cagcagcacc gccctcgcct cctgcagccg 
40 481 catcgcccgc cgagccacag ccacgatgat cgcgggctcc cttctcctgc ttggattcct 
541 tagcaccacc acagctcagc cagaacagaa ggcctcgaat ctcattggca cataccgcca 
601 tgttgaccgt gccaccggcc aggtgctaac ctgtgacaag tgtccagcag gaacctatgt 
661 ctctgagcat tgtaccaaca caagcctgcg cgtctgcagc agttgccctg tggggacctt 
721 taccaggcat gagaatggca tagagaaatg ccatgactgt agtcagccat gcccatggcc 
45 781 aatgattgag aaattacctt gtgctgcctt gactgaccga gaatgcactt gcccacctgg 
841 catgttccag tctaacgcta cctgtgcccc ccatacggtg tgtcctgtgg gttggggtgt 
901 gcggaagaaa gggacagaga ctgaggatgt gcggtgtaag cagtgtgctc ggggtacctt 
961 ctcagatgtg ccttctagtg tgatgaaatg caaagcatac acagactgtc tgagtcagaa 
1021 cctggtggtg atcaagccgg ggaccaagga gacagacaac gtctgtggca cactcccgtc 
50 1081 cttctccagc tccacctcac cttcccctgg cacagccatc tttccacgcc ctgagcacat 
1141 ggaaacccat gaagtccctt cctccactta tgttcccaaa ggcatgaact caacagaatc 
1201 caactcttct gcctctgtta gaccaaaggt actgagtagc atccaggaag ggacagtccc 
1261 tgacaacaca agctcagcaa gggggaagga agacgtgaac aagaccctcc caaaccttca 
1321 ggtagtcaac caccagcaag gcccccacca cagacacatc ctgaagctgc tgccgtccat 
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1381 ggaggccact gggggcgaga agtccagcac gcccatcaag ggccccaaga ggggacatcc 
1441 tagacagaac ctacacaagc attttgacat caatgagcat ttgccctgga tgattgtgct 
1501 tttcctgctg ctggtgcttg tggtgattgt ggtgtgcagt atccggaaaa gctcgaggac 
1561 tctgaaaaag gggccccggc aggatcccag tgccattgtg gaaaaggcag ggctgaagaa 
S 1621 atccatgact ccaacccaga accgggagaa atggatctac tactgcaatg gccatggtat 
1681 cgatatcctg aagcttgtag cagcccaagt gggaagccag tggaaagata tctatcagtt 
1741 tctttgcaat gccagtgaga gggaggttgc tgctttctcc aatgggtaca cagccgacca 
1801 cgagcgggcc tacgcagctc tgcagcactg gaccatccgg ggccccgagg ccagcctcgc 
1861 ccagctaatt agcgccctgc gccagcaccg gagaaacgat gttgtggaga agattcgtgg 

10 1921 gctgatggaa gacaccaccc agctggaaac tgacaaacta gctctcccga tgagccccag 
1981 cccgcttagc ccgagcccca tccccagccc caacgcgaaa cttgagaatt ccgctctcct 
2041 gacggtggag ccttccccac aggacaagaa caagggcttc ttcgtggatg agtcggagcc 
2101 ccttctccgc tgtgactcta catccagcgg ctcctccgcg ctgagcagga acggttcctt 
2161 tattaccaaa gaaaagaagg acacagtgtt gcggcaggta cgcctggacc cctgtgactt 

15 2221 gcagcctatc tttgatgaca tgctccactt tctaaatcct gaggagctgc gggtgattga 
2281 agagattccc caggctgagg acaaactaga ccggctattc gaaattattg gagtcaagag 
2341 ccaggaagcc agccagaccc tcctggactc tgtttatagc catcttcctg acctgctgta 
2401 gaacataggg atactgcatt ctggaaatta ctcaatttag tggcagggtg gttttttaat 
2461 tttcttctgt ttctgatttt tgttgtttgg ggtgtgtgtg tgtgtttgtg tgtgtgtgtg 

20 2521 tgtgtgtgtg tgtgtgtgtg tttaacagag aatatggcca gtgcttgagt tctttctcct 
2581 tctctctctc tctttttttt ttaaataact cttctgggaa gttggtttat aagcctttgc 
2641 caggtgtaac tgttgtgaaa tacccaccac taaagttttt taagttccat attttctcca 
2701 ttttgccttc ttatgtattt tcaagattat tctgtgcact ttaaatttac ttaacttacc 
2761 ataaatgcag tgtgactttt cccacacact ggattgtgag gctcttaact tcttaaaagt 

25 2821 ataatggcat cttgtgaatc ctataagcag tctttatgtc tcttaacatt cacacctact 
2881 ttttaaaaac aaatattatt actattttta ttattgtttg tcctttataa attttcttaa 
2941 agattaagaa aatttaagac cccattgagt tactgtaatg caattcaact ttgagttatc 
3001 ttttaaatat gtcttgtata gttcatattc atggctgaaa cttgaccaca ctattgctga 
3061 ttgtatggtt ttcacctgga caccgtgtag aatgcttgat tacttgtact cttcttatgc 

30 3121 taatatgctc tgggctggag aaatgaaatc ctcaagccat caggatttgc tatttaagtg 
3181 gcttgacaac tgggccacca aagaacttga acttcacctt ttaggatttg agctgttctg 
3241 gaacacattg ctgcactttg gaaagtcaaa atcaagtgcc agtggcgccc tttccataga 
3301 gaatttgccc agctttgctt taaaagatgt cttgtttttt atatacacat aatcaatagg 
3361 tccaatctgc tctcaaggcc ttggtcctgg tgggattcct tcaccaatta ctttaattaa 

35 3421 aaatggctgc aactgtaaga acccttgtct gatatatttg caactatgct cccatttaca 
3481 aatgtacctt ctaatgctca gttgccaggt tccaatgcaa aggtggcgtg gactcccttt 
3541 gtgtgggtgg ggtttgtggg tagtggtgaa ggaccgatat cagaaaaatg ccttcaagtg 
3601 tactaattta ttaataaaca ttaggtgttt gttaaaaaaa aaaaaaaaaa aaaaaaaaaa 
3661 aa 

40 SEQ ID NO: 236 

Amino acid sequence of human TNFRSF21 encoded by the DNA sequence shown in SEQ ID 
NO: 235. 

MGTSPSSSTA1ASCSRIARRATATMIAGSLLLLGFLSTTTAQPBQKASNLIGTYRHVDRA 
TGQVLTCDKCPAGTWSEHCTNTSLRVCSSCPVGTFTRHENGIEKCHDCSQPCPWPMIEK 

45 LPCAALTDRECTCPPGMFQSNATCAPHTVCPVGWGVRKKGTETEDVRCKQCARGTFSDVP 
SSVMKCKAYTDCLSQNLWIKPGTKETDNVCGTLPSFSSSTSPSPGTAIFPRPEHMETHE 
VPSSTWPKGMNSTESNSSASVRPKVLSSIQEGTVPDNTSSARGKEDVNKTLPNLQVVNH 
QQGPHHRHILKLLPSMEATGGEKSSTPIKGPKRGHPRQNLHKHFDINEHLPWMIVLFLLL 
VLVVIVVCSIRKSSRTLKKGPRQDPSAIVEKAGLKKSMTPTQNREKWIYYCNGHGIDILK 

50 LVAAQVGSQWKDI YQFLCMASEREVAAFSNGYTADHERAYAAIjQHWTIRGPEASLAQLI S 
AIiRQHRROTVVEKIRGLMEDTTQLETDiOiALPMSPSPLSPSPIPSPNAKLENSALLTVEP 
SPQDKNKGFFVDESEPLLRCTSTSSGSSALSRNGSFITKEKKDTVLRQVRIJDPCDLQPIF 
DDMLHFLNPEELRV1EEIPQAEDKLDRLFEIIGVKSQEASQTLLDSVYSHLPDLL 

SEQ ID NO: 237 
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gi|31 34 1673|ref|NM_l 78589.2| Mus musculus tumor necrosis factor receptor superfamily, 
member 21 (TnfrsCl), mRNA 

1 aggtgtcccc gagctgagtg gccatccgac tcagtccctc gccggccggt ctaggcagcg 
61 gaggaggcga gtgcttgata gtggctggaa gcttcgctat gggaagtcgc tctttagcac 
5 121 tgtcgcggct agccctgctc tctggttctc cgcagccgct gtcgttggag agcaccggga 
181 ggcgcgggtt gcgagcgcgc ctgcttctca ccgcccgggc gccagcgccc ctgggcaggt 
241 gctgagcgcc tttcggagcc tcccctgctg cctccctctt ccgcctgggt gcctggctgc 
301 tgcagtgcac atgggctgct ggaggtagat gggctcaccg cccgtgaggc ggcggtggat 
361 gcggcgctgg gcagaaacag ccaccaattc cagctgccgt ggggccgagc gccccgggcg 
10 421 ccgctgcgag ccccgagctc ggccatgggg acccgggcaa gcagcatcac cgccctcgcc 
481 tcttgcagcc gcaccgccgg ccaagtcgga gccacgatgg tcgccggctc tcttctcctg 
541 cttggattcc tcagcaccat cacagctcaa ccagaacaaa agactctgag tctccctggc 
601 acctaccgcc atgttgaccg taccactggc caggtgctaa cctgcgacaa gtgcccagca 
661 ggaacgtacg tctccgagca ctgtaccaac atgagcctgc gagtctgcag cagctgcccc 
15 721 gcggggacct ttaccaggca cgagaacggc atagagagat gccatgactg tagtcagcca 
781 tgtccatggc cgatgattga gagattacct tgtgctgcct tgactgaccg agagtgcatc 
841 tgcccacctg gaatgtatca gtctaatggt acctgcgctc cccatacagt gtgccccgtg 
901 ggctggggtg tgcggaagaa agggacagag aatgaagatg tgcgctgtaa gcagtgcgct 
961 cggggtacct tctctgacgt gccttccagt gtgatgaagt gtaaagctca cacggactgt 
20 1021 ctgggtcaga acctggaggt ggtcaagcca gggaccaagg agacagacaa cgtctgtggc 
1081 atgcgcctgt tcttctccag cacaaaccca ccttcctctg gcacagttac cttttctcac 
1141 cctgagcata tggaatccca cgatgtccct tcctccacct atgagcccca aggcatgaac 
1201 tcaacagatt ccaactctac tgcctctgtt agaactaagg taccaagtgg catcgaggaa 
1261 gggacagtgc ctgacaatac gagctcaacc agtgggaagg aaggcactaa taggaccctg 
25 1321 ccaaacccac cacaagttac ccaccagcaa gccccccacc acagacacat tctgaagctg 
1381 ctgccatcgt ccatggaggc cacgggtgag aagtccagca cagccatcaa ggcccccaag 
1441 aggggtcacc ccagacagaa cgctcacaag catttcgaca tcaacgagca cttgccttgg 
1501 atgatcgtcc tcttccttct gctggtcctg gtgctgatag tggtgtgcag tatccgaaag 
1561 agctccagga ctctcaaaaa ggggccccgg caggatccca gcgccatagt ggaaaaggcg 
30 1621 gggctgaaga agtccctgac tcccacccag aaccgggaga aatggatcta ctaccgcaac 
1681 ggccatggta ttgacatctt gaagcttgta gcagcccagg tgggaagcca gtggaaggac 
1741 atctatcagt ttctttgcaa cgccagcgag agggaggtgg cggccttctc caatggatac 
1801 actgcagatc atgaacgggc ctacgcggct ctgcagcact ggaccatccg tggccctgag 
1861 gccagccttg cccagctcat tagcgccttg cgccagcacc gacgcaatga tgttgtggag 
35 1921 aagattcgtg ggctgatgga agacaccacg cagttggaaa cagacaaact ggctctcccc 
1981 atgagcccca gtccgcttag cccgagcccc atgcccagtc ctaacgtgaa acttgagaat 
2041 tccactctcc tgacagtgga gccctcaccg ctggacaaga acaagtgctt cttcgtggac 
2101 gagtcagagc cccttctgcg atgcgactcc acatccagtg gctcttcagc actgagcaga 
2161 aacggctcct ttattaccaa agaaaagaag gacacagtgt tgcggcaggt ccgcctggac 
40 2221 ccctgtgact tgcagcccat ctttgatgac atgctgcata tcctgaaccc cgaggagctg 
2281 cgggtgattg aagagattcc ccaggctgag gacaaactgg accgcctctt cgagatcatt 
2341 ggggtcaaga gccaagaagc cagccagacc ctcttggact ctgtgtacag tcatcttcct 
2401 gacctattgt agaacacagg ggcactgcat tctgggaatc aacctactgg cggggtgatt 
2461 tcatttcgtt tctgactttt gtgttttggt gtgtatgtat gtgtttaaca gagtgtatgg 
45 2521 ccggtgagtt tgggttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt 
2581 ctttctttct ttctttcttt cttccttcct gaaagtgaat gtataaagcc tttacaatgt 
2641 ataactgttg gaaaatgccc accactaaat ttttttttaa gttccacata ttctccattt 
2701 ttgccttctt atatatatct tcaacactat tctgtgcact ttaaaaactt aacataaacg 
2761 cagtgtgact tctcccatat gctgggtccc gagactctca acttcttaaa aacctaatgg 
50 2821 catcttgtga ctcctagaag tagacataag tctttcaacc ttcacaccta ctctttctgt 
2881 tttaattatt attgctattt gtcttattgt ttgtgcttta caagcgttct tgaggacgga 
2941 gggaatctac gaccctgttg atgactgtaa ctctattcga ctttgagttg tcttcttcat 
3001 gtcttgttat atagttcata ttcatggctg aaacttgacc atactcccta gcgctgattg 
3061 tatggttttc gtctggacac cgtacactgc ctgataactt gtgcacctct taacgctact 
55 3121 atgctctggg ctggagaatg aaatctttaa gtcaccagga cttgctgttt cagtggcttg 
3181 acacctgggc caccaaagaa ctcgatcttc atcttttagg gacacctctg ctgcaccttg 
3241 gaaagccaac cttaagtgcc agtggcactt tatgcccagc tttgctttga aagatatctt 
3301 tcttgttttt tttatccttc tctttctctc ttttttttaa aaatacacat agtcaatagg 
3361 tccagtctgc cctcaaggcc ttgctgggtt ttcttcatca tccaatcact ttcattaaaa 
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3421 atggctgcag ctgtaagaac tcttgtctga taaattttca actatgctct catttatcta 

3481 cctgccctct gatgctcagt cgtcagactc taatgcaaag gtggacgtcg gctgcctttg 

3541 cgtgggcggg cttagtggtg aggaactgat atcagaaaaa aaatgccttc aagtatacta 
3601 atttattaat aaatattagg tgtttgtt 

5 SEQIDNO:238 

Amino acid sequence of mouse TNFRSF21 encoded by the DNA sequence shown in SEQ ID 
NO: 237. 

MGTRASSITALASCSRTAGQVGATWAGSLLLLGFLSTITAQPEQKTLSLPGTYRHVDRT 
TGQVLTCDKCPAGTYVSEHCTNMSLRVCSSCPAGTFTRHBNGIERCHDCSQPCPWPMIER 
1 0 LPCAALTDRECICPPGMYQSNGTCAPHTVCPVGWGVRKKGTENEDVRCKQCARGTFSDVP 
SSVMKCKAHTDCLGQNLEWKIKSTKETDWOT^ 

VPSSTYEPQGIWSTDSNSTASVRTKVPSGIEEGTVPDNTSSTSGKEGTNRTLPNPPQVTH 
QQAPHHRHILKLLPSSMEATGEKSSTAIKAPKRGHPRQNAHKHFDINEHLPWMIVLFLLL 
VLVLIWCSIRKSSRTLKKGPRQDPSAIVEKAGLKKSLTPTQNREKWIYYRNGHGIDILK 
15 LVAAQVGSQWKDI YQFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASLAQL I S 
ALRQHRRNDV^KIRGLMEDT^LETDKLALPMSPSPLSPSPMPSPNVKLENSTLLTVE^ 
SPLDKNKCFFVDESEPLLRCTSTSSGSSALSRNGSFITKEKKDTVLROVRLDPCDLQPIF 
DDMLHILNPEELRVIEEIPQAEDKLDRLFEIIGVKSOEASQTLIiDSVYSHLPDLL 

SEQ ID NO: 239 

20 gi|34874517|refpCM_236992.2| Rattus notvegicus similar to death receptor 6 (LOC316256), 
mRNA 

1 ccgggaggcg cgggttgcaa gcgcgcctgc ttctccccgc ccgggcgcca gcgcccctgg 
61 gcaggtgctg agcgccttct ccgagcctcc cctgctgcct ccctcttccg cctagccgcc 
121 ttgctgctgc agtgcacatg ggctgctgga ggtagatggg ctcaccgccc gtgaggcggc 
25 181 ggtggatgcg gcgctgggca gaaacagcca ccgattccag ctgccgtggg gccgagcgcc 
241 cggagcgccg ctgcgagccc cgggctcagc catggggacc tccgcaagca gcatcaccgc 
301 cctcgcctct tgcagccgca tcgccggcca agttggagcc acgatggtcg ccggctccct 
361 tctcttgctt gggttcctca gcaccatcac agcccaacca gaacaaaaga ctctgagtct 
421 cacgggcacg taccgccacg ttgaccgtac cactggccag gtgctaacct gcgacaagtg 
30 481 tccggcagga acgtatgtct ccgagcactg taccaacacg agcctgcgag tctgcagcag 
541 ctgcccctcg gggaccttta ccaggcatga gaacggcata gagagatgcc atgactgtag 
601 tcagccatgc ccacggccga tgattgagag attaccttgt gctgccttga ctgaccgaga 
661 atgcatctgc ccacccggaa tgtatcagtc taatgggacc tgcgctcccc acacggtgtg 
721 ccccgtgggc tggggtgtga ggaagaaagg gacagagaat gaagatgtgc ggtgtaagca 
35 781 gtgtgctcgc ggtaccttct ctgacgtgcc ttccagtgtg atgaagtgta gagcccacac 
841 ggactgtctg ggtcagaacc tgatggtggt caagcagggg actaaggaga cagacaacgt 
901 ctgtggcgtg cacctgtcct cctccagcac gaccccatct tcccctggca tagctacctt 
961 ttctcatcct gagcacacgg aatcccacga tgtcccctcc tccacctatg agccccaagg 
1021 catgaactca acagattcca actctactgc ctctgttaga actaaggtac caagtgacat 
40 1081 ccaggaagag acagtgcctg acaatacaag ctccacgagt gggaaggaga gcacgaacag 
1141 gaccctgcca aacccaccac aacttaccca ccagcaaggc ccccaccaca gacacattct 
1201 gaagctgctg ccatcatcca tggaggccac tggtgagaag tccagcacag ccatcaaggc 
1261 ccccaagagg ggccatccca gacagaaccc acacaagcat ttcgacatca acgagcattt 
1321 gccctggatg attgtactct tccttctgct ggtcctggtg ctcattgtgg tgtgcagtat 
45 1381 ccgaaagagc tctaggactc tcaagaaggg gccccggcag gatcccagtg ccatcatgga 
1441 aaaggcaggg ctgaagaagt ccttgactcc aacccagaac cgggagaaat ggatctacta 
1501 ccgcaatggc cacggtattg acatcctgaa gcttgtagca gcgcaggtgg gaagccagtg 
1561 gaaggacatc tatcagtttc tttgcaacgc cagcgagagg gaggtggctg ccttctccaa 
1621 tggatacact gcagaccatg aacgagccta tgcagctctg cagcactgga ccatccgggg 
50 1681 ccctgaggcc agccttgctc agctcatcag cgccttgcgc cagcaccgac gcaatgatgt 
1741 tgtggagaag attcgtgggc tgatggaaga tactacgcag ttggaaaccg acaaactggc 
1801 tctccccatg agccccagtc cgcttagccc gagccccatt cccagtccta acgtgaaact 



266 

BNSDOCIO <WO__200800740CA2_I_> 



WO 2006/007400 



PCT/US2005/021297 



1861 tgagaattcc actctcctga cagtggagcc ctcacccctg gacaagaaca agggcttctt 

1921 cgtggacgag tcggagcccc ttctgcgttg tgactccacg tccagcggct cttcagcact 

19B1 gagcaggaat ggctccttta taaccaaaga aaagaaggac acagtgttgc ggcaggtccg 

2041 cctggacccc tgtgacttgc agcccatctt tgacgacatg ctgcacatcc tgaatccgga 

5 2101 agagctgcgg gtaatcgaag agattcccca ggctgaggac aaactggacc gtctcttcga 

2161 gatcattggg gtcaagagcc aagaagccag ccagaccctc ttggactctg tgtacagcca 
2221 tcttcctgac ctattgtag 

SEQ ID NO: 240 

Amino acid sequence of rat TNFRSF21 encoded by the DNA sequence shown in SEQ ID 
10 NO: 239. 

MGTSASS I TALASCSRI AGQVGATMVAQSLIiLLGFLSTITAQPEQKTLSLTGTYRHVDRT 
TGQVLTCDKCPAGTYVSEHCTNTSIJ^VCSSCPSGTFTRHENGIERCHDCSQPCPRPMIER 
LPCAALTDRE CI CP PGMYQSNGTCAPHTVCPVGWGVRKKGTEKEDVRCKQCARGTFSDVP 
S S VMKCRAHTDCLGQNLMVVKQGTKETDNVCGVHLS S S STTPS S PG I ATFSH PEHTE S HD 

15 VPSSTYEPQGMNSTDSNSTASVRTKVPSDIQEETVPDNTSSTSGKESTNRTIiPNPPQLTH 
QQGPHHRHILKLLPSSMEATGEKSSTAIKAPKRGHPRQNPHKHFDINEHLPWMIVLFLLL 
VLVL I WCS I RKS S RTLKKGPRQD PS Al MEKAGLKKS LTPTQNRE KW I Y YRNGHG ID I LK 
LVAAQVGSQWKDIYQFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASIiAQLIS 
ALRQHRRNDVVEKIRGLMEDTTQLETDKLALPMSPSPLSPSPIPSPWKIaENSTLLTVEP 

20 SPLDKNKGFFVDESEPLLRCDSTSSGSSALSRNGSFITKEKKDTVLRQVRLDPCOLQPIF 
DDMLHIliNPEEUlVIEEXPQAJEDKLDRLFEIIGVKSQEASQTLLDSVYSHLPDIiL 

SEQ ID NO: 241 

gi|235lO443|ref]NM_006573.3| Homo sapiens tumor necrosis factor (ligand) superfamily, 
member 13b (TNFSF13B), mRNA 

25 1 gaaattctta caaaaactga aagtgaaatg aggaagacag attgagcaat ccaatcggag 

61 ggtaaatgcc agcaaaccta ctgtacagta ggggtagaga tgcagaaagg cagaaaggag 
121 aaaattcagg ataactctcc tgaggggtga gccaagccct gccatgtagt gcacgcagga 
181 catcaacaaa cacagataac aggaaatgat ccattccctg tggtcactta ttctaaaggc 
241 cccaaccttc aaagttcaag tagtgatatg gatgactcca cagaaaggga gcagtcacgc 

30 301 -cttacttctt gccttaagaa aagagaagaa atgaaactga aggagtgtgt ttccatcctc 
361 ccacggaagg aaagcccctc tgtccgatcc tccaaagacg gaaagctgct ggctgcaacc 
421 ttgctgctgg cactgctgtc ttgctgcctc acggtggtgt ctttctacca ggtggccgcc 
481 ctgcaagggg acctggccag cctccgggca gagctgcagg gccaccacgc ggagaagctg 
541 ccagcaggag caggagcccc caaggccggc ctggaggaag ctccagctgt caccgcggga 

35 601 ctgaaaatct ttgaaccacc agctccagga gaaggcaact ccagtcagaa cagcagaaat 
661 aagcgtgccg ttcagggtcc agaagaaaca gtcactcaag actgcttgca actgattgca 
721 gacagtgaaa caccaactat acaaaaagga tcttacacat ttgttccatg gcttctcagc 
781 tttaaaaggg gaagtgccct agaagaaaaa gagaataaaa tattggtcaa agaaactggt 
841 tactttttta tatatggtca ggttttatat actgataaga cctacgccat gggacatcta 

40 901 attcagagga agaaggtcca tgtctttggg gatgaattga gtctggtgac tttgtttcga 
961 tgtattcaaa atatgcctga aacactaccc aataattcct gctattcagc tggcattgca 
1021 aaactggaag aaggagatga actccaactt gcaataccaa gagaaaatgc acaaatatca 
1081 ctggatggag atgtcacatt ttttggtgca ttgaaactgc tgtgacctac ttacaccatg 
1141 tctgtagcta ttttcctccc tttctctgta cctctaagaa gaaagaatct aactgaaaat 

45 1201 acca 

SEQ ID NO: 242 

Amino acid sequence of human TNFSF13B encoded by the DNA sequence shown in SEQ ID 
NO: 241. 
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MDDSTERBQSItfjTSCIiKKREBMKLKECVSIL^ 

LTWSFYQVAALQGDLA3 LRAELQGHHAEKLPAGAGAPKAGLEEAP AVTAGL KI FE P PAP 
GEGNSSQNSRNKRAVQGPEETVTQDCLQLIADSETPTIQKGSYTFVPWLLSPKRGSAliEE 
KENKILVKETGYFFIYGQyLYTDKTYAMGHLIQRKKV^ 
5 PNNSCYSAGIAKLEEGDELQLAIPREKAQISLDGDVTFFGALKLL 

SEQIDNO:243 

Amino acid sequence of human TNFSF13B, a soluble active secreted form derived from SEC 
EDNO:242. 

AVQGPEETVTQDCLQLIADSETPTIQKGSYTFVPWLLSPKRGSALEEKENKILVKETGYF 
10 FIYGQVLYTDKTYAMGHLIQRKKVHVFGDELSLVTLFRCIQNMPE^TLPNNSCYSAGIAKL 
EEGDELQLAI PRENAQI SLDGDVTFFGALKLL 

SEQIDNO: 244 

gi|324549i l|gb|AY302751.1| Homo sapiens delta BAFF (TNFSF13b) mRNA, complete cds; 
alternatively spliced 

15 1 atggatgact ccacagaaag ggagcagtca cgccttactt cttgccttaa gaaaagagaa 

61 gaaatgaaac tgaaggagtg tgtttccatc ctcccacgga aggaaagccc ctctgtccga 
121 tcctccaaag acggaaagct gctggctgca accttgctgc tggcactgct gtcttgctgc 
181 ctcacggtgg tgtctttcta ccaggtggcc gccctgcaag gggacctggc cagcctccgg 
241 9 ca 9 a 9 ct 9 c agggccacca cgcggagaag ctgccagcag gagcaggagc ccccaaggcc 

20 301 ggcctggagg aagctccagc tgtcaccgcg ggactgaaaa tctttgaacc accagctcca 
361 ggagaaggca actccagtca gaacagcaga aataagcgtg ccgttcaggg tccagaagaa 
421 acaggatctt acacatttgt tccatggctt ctcagcttta aaaggggaag tgccctagaa 
481 gaaaaagaga ataaaatatt ggtcaaagaa actggttact tttttatata tggtcaggtt 
541 ttatatactg ataagacct^a cgccatggga catctaattc agaggaagaa ggtccatgtc 

25 601 tttggggatg aattgagtct ggtgactttg tttcgatgta ttcaaaatat gcctgaaaca 
661 ctacccaata attcctgcta ttcagctggc attgcaaaac tggaagaagg agatgaactc 
721 caacttgcaa taccaagaga aaatgcacaa atatcactgg atggagatgt cacatttttt 
781 ggtgcattga aactgctgtg a 

SEQIDNO: 245 

30 Amino acid sequence of human TNFSF1 3B variant ORF number 1 encoded by the DNA 
sequence shown in SEQ ID NO: 244. 

MDDSTEREQSRLTSCLKKREEMKLKEWSILPRKESPSVRSSKDGKLLAATLLLALLSCC 
LTWS F YQ VAAL QGDLAS LRAELQGHHAEKL P AGAG AP KAG LE E AP A VT AGLK I FE P PAP 
GEGNSSQNSRNKRAVQG PBETGS YTFVP WLL S FKRGS ALEEKENKI LVKETGY FF I YGQV 
35 LYTDKTYAMGHLIQRKKWVFGDELSLVTLFRCIQNMPETLPNNSCYSAGIAKIiEEGDEL 
QLiAl PRENAQI SLDGDVTFFGALKLL 

SEQ ID NO: 246 

gi |3244 1 946|gb| A Y290823 . 1 1 Mus musculus delta BAFF (Tnfsfl3b) mRNA, complete cds; 
alternatively spliced 

40 1 atggatgagt ctgcaaagac cctgccacca ccgtgcctct gtttttgctc cgagaaagga 

61 gaagatatga aagtgggata tgatcccatc actccgcaga aggaggaggg tgcctggttt 
121 gggatctgca gggatggaag gctgctggct gctaccctcc tgctggccct gttgtccagc 
181agtttcacag cgatgtcctt gtaccagttg gctgccttgc aagcagacct gatgaacctg 
241 cgcatggagc tgcagagcta ccgaggttca gcaacaccag ccgccgcggg tgctccagag 
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301 ttgaccgctg gagtcaaact cctgacaccg gcagctcctc gaccccacaa ctccagccgc 
361 ggccacagga acagacgcgc tttccaggga ccagaggaaa cagaacaaga tgtagacctc 
421 tcagctcctc ctgcaccatg cctgcctgga tgccgccatt ctcaacatga tgataatgga 
481 atgaacctca gaaacagaac ttacacattt gttccatggc ttctcagctt taaaagagga 
5 541 aatgccttgg aggagaaaga gaacaaaata gtggtgaggc aaacaggcta tttcttcatc 
601 fcacagccagg ttctatacac ggaccccatc tttgctatgg gtcatgtcat ccagaggaag 
661 aaagtacacg tctttgggga cgagctgagc ctggtgaccc tgttccgatg tattcagaat 
721 atgcccaaaa cactgcccaa caattcctgc tactcggctg gcatcgcgag gctggaagaa 
781 ggagatgaga ttcagcttgc aattcctcgg gagaatgcac agatttcacg caacggagac 
10 841 gacaccttct ttggtgccct aaaactgctg taa 

SEQ ID NO: 247 

Amino acid sequence of mouse TNFSF13B encoded by the DNA sequence shown in SEQ ID 
NO: 246. 

MDESAKTLPPPCLCPCSEKGEDMKVGYDPITPQKEEGAWFQICRDGRLLAATLLIiALIjSS 
1 5 SFTAMSLYQIJUUjQADIiMNLRMELQSVRGSATPAAAGAPELTAGVKLLTPAAPRPHNSSR 
GHRNRRAFQGPEETEQDVDLSAPPAPCLPGCRHSQHDDNGMNIiRNRTYTFVPWLLSFKRG 
NALEEKENKI VVRQTGYFF I YSQVL YTDP I FAMGHVIQRKKVHVFGDELSLVTLFRC I QN 
MPKTLPNNSCYSAG I ARLEEGDE IQLAI PRENAQISRNGDDTFFGALKLL 

SEQ ID NO: 248 

20 ENSRNOT00000019397 cDNA sequence, EnsEMBL transcript [Rattus norvegicus] 

1 atggataagt ctgcaaagaa cctgccacca ccgcgcctct gtttttgccc tgagaaagga 
61 gaagatatga aagtgggata tgttcccatc actccgcaga aggagggtgc ctgggttggg 
121 atctgcaggg acagacggct gctggctgct accctcctgc tggctctgtt gtccagcagt 
181 ttcacagcga tgtccttgta ccagttggct gtcctgcaag cagacctgat gagcctgcgc 

25 241 atggagctgc agagctaccg aagttcagcg acacccgccg ccccgggggc tccagggttg 
301 tccgcgggag tcaaactccc aacacccgca gctcctggac cccacaactc cagccgaggc 
361 caaaggaaca gacgcgcttt ccagggaccg gaggaaacag aacaagatgt agacctctca 
421 gctactccag tgccatccct gcctggaaac tgccatgctt ctcaccatga tgagaatgga 
481 ctgaacctca gaaccatcat tcaagactgt ctgcagctga ttgcagacag caacacgccg 

30 541 actatacgaa aaggaactta cacatttgtt ccatggcttc tcagctttaa aagaggaaac 
601 gccttggagg agaaagagaa caaaatagtg gtgaggcaaa caggctattt cttcatctac 
661 agccaggttc tgtacacgga ccccatcttt gccatgggtc atgtcatcca gaggaagaaa 
721 atacacgtgt ttggggatga gctgagcctg gtgactctgt tccgatgcat ccagaatatg 
781 ccgaaaacac tgcccaacaa ttcctgctac tccgctggca tcgcgaagct ggaagaaggg 

35 841 gacgagattc agcttgcaat acctcgggag aatgcccaga tttcacggaa cggagacgac 
501 accttctttg gtgccctaaa actgctgtga 

SEQ ID NO: 249 

Amino acid sequence of rat TNFSF13B encoded by the DNA sequence shown in SEQ ID 
NO: 248. 

40 MDKSAKNLPPPRLCFCPBKGEDMKVG YVP ITPQKEGAWVGICRDRRliLAATLLlAIiIiSS S 
FTAMS L YQLAVLQADLMS LRME LQS YRS S ATPAAPGAPGLS AG VKLPTPAAPGPHNS S RG 
QRNRRAFQGPEETEQDVDL8ATPVPSLPGNCHASHHDENGLNLRTIIQDCLQLIADSNTP 
TIRKGTYTPVPWLLSFKRGNALEEKENKIWRQTGYFFIYSQVIiYTDPIPAMGHVIQRKK 
IHVFGDELSLVTLFRCIQNMPKTLPNNSCYSAGIAKLEEGDEIQLAIPRENAQISRNGDD 

45 TFFGALKLL 

SEQ ID NO: 250 
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gi|25952143|ref|NMJ)03807.2| Homo sapiens tumor necrosis factor Gigand) superfamily, 
member 14 (TNFSFT4), transcript variant 1, mRNA 

1 cgagactcca tctcaaaaac aaaacaaata aacgaacaaa aaaacccaca acgtattatt 
61 ttcttgttta cgaggtttct tgtctctctg gctccaccag aagaggagca gggacccttc 
5 121 ttgctgttgt tcattgctgc atcccccaca ccgagagcag agcctggcat gggcagaaag 
181 tcctcagtcg atatttggtg gccccaagcg aatgaagcat ccaagaaggg aaagctgggg 
241 gctccccact gcacttgcca cctgagtcac attttcagaa gcctctggaa agtcgtgcac 
301 agcccaggag tgttgagcaa tttcggtttc ctctgaggtt gaaggaccca ggcgtgtcag 
361 ccctgctcca gacaccttgg gcatggagga gagtgtcgta cggccctcag tgtttgtggt 
10 421 ggatggacag accgacatcc cattcacgag gctgggacga agccaccgga gacagtcgtg 
481 cagtgtggcc cgggtgggtc tgggtctctt gctgttgctg atgggggccg ggctggccgt 
541 ccaaggctgg ttcctcctgc agctgcactg gcgtctagga gagatggtca cccgcctgcc 
601 tgacggacct gcaggctcct gggagcagct gatacaagag cgaaggtctc acgaggtcaa 
661 cccagcagcg catctcacag gggccaactc cagcttgacc ggcagcgggg ggccgctgtt 
IS 721 atgggagact cagctgggcc tggccttcct gaggggcctc agctaccacg atggggccct 
781 tgtggtcacc ^aaagctggct actactacat ctactccaag gtgcagctgg gcggtgtggg 
841 ctgcccgctg ggcctggcca gcaccatcac ccacggcctc tacaagcgca caccccgcta 
901 ccccgaggag ctggagctgt tggtcagcca gcagtcaccc tgcggacggg ccaccagcag 
961 ctcccgggtc tggtgggaca gcagcttcct gggtggtgtg gtacacctgg aggctgggga 
20 1021 gaaggtggtc gtccgtgtgc tggatgaacg cctggttcga ctgcgtgatg gtacccggtc 
1081 ttacttcggg gctttcatgg tgtgaaggaa ggagcgtggt gcattggaca tgggtctgac 
1141 acgtggagaa ctcagagggt gcctcagggg aaagaaaact cacgaagcag aggctgggcg 
1201 tggtggctct cgcctgtaat cccagcactt tgggaggcca aggcaggcgg atcacctgag 
1261 gtcaggagtt cgagaccagc ctggctaaca tggcaaaacc ccatctctac taaaaataca 
25 1321 aaaattagcc ggacgtggtg gtgcctgcct gtaatccagc tactcaggag gctgaggcag 
1381 gataattttg cttaaacccg ggaggcggag gttgcagtga gccgagatca caccactgca 
1441 ctccaacctg ggaaacgcag tgagactgtg cctcaaaaaa aaaaaaaaaa a 



SEQ ID NO: 251 

30 Amino acid sequence of human TNFSF1 4 encoded by the DNA sequence shown in SEQ ID 
NO: 250. 

MEESVVRPS VFVVDGQTD I PFTlUiGRSHRRQSCS VARVOLGLl>LLLMGAGIAVQG WPLLQ 
LHWRLGEMVTRLPDGPAGSWEQLIQERRSH 
AFLRGLSYHDGALVVTTCAGYTYIYSKVQI^ 
35 VSQQSPCGRATS S SRVWWDS S PIX3GVVHLEAGEKVVVRVLDERLVRLRDGTRS YFGAFMV 

SEQ ID NO: 252 

Amino acid sequence of human TNFSF14, a soluble active secreted form derived fiom SEQ 
IDNO:251, 

QERRSHEVNPAAHLTGANS S LTGSGGPLLWETQLGLAFLRGLS YHDGALVVTKAG YY Y I Y 
40 SKVQIiGGVGCPLGLASTITHGIiYKRTPRYPEELELLVSQQSPCGRATSSSRVWWDSSFLG 
GWHLEAGEKVWRVLDERLVRLRDGTRSYFGAFMV 

SEQ ID NO: 253 

gi| 1 7390 1 1 9|gb|BC01 8058.1| Homo sapiens tumor necrosis factor (ligand) superfamily, 
member 14, mRNA (cDNA clone MGC:26477 IMAGE.4793299), complete cds 

45 l ggtgcacagc ccaggagtgt tgagcaattt cggtttcctc tgaggttgaa ggacccaggc 

61 gtgtcagccc tgctccagac accttgggca tggaggagag tgtcgtacgg ccctcagtgt 
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121 ttgtggtgga tggacagacc gacatcccat tcacgaggct gggacgaagc caccggagac 
181 agtcgtgcag tgtggcccgg gtgggtctgg gtctcttgct gttgctgatg ggggccgggc 
241 tggccgtcca aggctggttc ctcctgcagc tgcactggcg tctaggagag atggtcaccc 
301 gcctgcctga cggacctgca ggctcctggg agcagctgat acaagagcga aggtctcacg 
361 aggtcaaccc agcagcgcat ctcacagggg ccaaccccag cttgaccggc agcggggggc 
421 cgctgttatg ggagactcag ctgggcgtgg ccttcctgag gggcctcagc taccacgatg 
481 gggcccttgt ggtcaccaaa gctggctact actacatcta ctccaaggtg cagctgggcg 
541 gtgtgggctg cccgctgggc ctggccagca ccatcaccca cggcctctac aagcgcacac 
601 cccgctaccc cgaggagctg tagctgttgg tcagccagca gtcaccctgc ggacgggcca 
661 ccagcagctc ccgggtctgg tgggacagca gcttcctggg tggtgtggta cacctggagg 
721 ctggggagga ggtggtcgtc cgtgtgctgg atgaacgcct ggttcgactg cgtgatggta 
781 cccggtctta cttcggggct ttcatggtgt gaaggaagga gcgtggtgca ttggacatgg 
,841 gtctgacacg tggagaactc agagggtgcc tcaggggaaa gaaaactcac gaagcagagg 
901 ctgggcgtgg tggctctcgc ctgtaatccc agcactttgg gaggccaagg caggcggatc 
961 acctgaggtc aggagttcga gaccagcctg gctaacatgg caaaacccca tctctactaa 
1021 aaatacaaaa attagccgga cgtggtggtg cctgcctgta atccagctac tcaggaggct 
1081 gaggcaggat aattttgctt aaacccggga ggcggaggtt gcagtgagcc gagatcacac 
1141 cactgcactc caacctggga aacgcagtga gactgtgcct caaaaaaaaa gaaagaaaaa 
1201 aaaaaaaaaa a 

SEQIDNO:254 

Amino acid sequence of human TNFSF14 variant ORF number 1 encoded by the DNA 
sequence shown in SEQ ID NO: 253. 

MEESVVIU>S VF\nTOGOTDI PFTRLGRSHRR^ 
IiHWRLGEMVTRLPMPAGSWEQLIQE^SHEVNPAA^^ 

AFLRGLSYHDGALWTKAGYyyiYSKVQLGGVGCPLQLASTITHGLYKRTPRYPEEL 

SEQ ID NO: 255 

gi|25952146|ref|NM_172014;i| Homo sapiens tumor necrosis factor (ligand) superfamily, 
member 14 (TNFSF14), transcript variant 2, mRNA 

1 cgagactcca tctcaaaaac aaaacaaata aacgaacaaa aaaacccaca acgtattatt 
61 ttcttgttta cgaggcttct tgtctctctg gctccaccag aagaggagca gggacccttc 
121 ttgctgttgt tcattgctgc atcccccaca ccgagagcag agcctggcat gggcagaaag 
181 tcctcagtcg atatttggtg gccccaagcg aatgaagcat ccaagaaggg aaagctgggg 
241 gctccccact gcacttgcca cctgagtcac attttcagaa gcctctggaa agtcgtgcac 

3 01 agcccaggag tgttgagcaa tttcggtttc ctctgaggtt gaaggaccca ggcgtgtcag 
361 ccctgctcca gacaccttgg gcatggagga gagtgtcgta cggccctcag tgtttgtggt 
421 ggatggacag accgacatcc cattcacgag gctgggacga agccaccgga gacagtcgtg 

4 61 cagtgtggcc cgggacggac ctgcaggctc ctgggagcag ctgatacaag agcgaaggtc 
541 tcacgaggtc aacccagcag cgcatctcac aggggccaac tccagcttga ccggcagcgg 
601 ggggccgctg ttatgggaga ctcagctggg cctggccttc ctgaggggcc tcagctacca 
661 cgatggggcc cttgtggtca ccaaagctgg ctactactac atctactcca aggtgcagct 
721 gggcggtgtg ggctgcccgc tgggcctggc cagcaccatc acccacggcc tctacaagcg 
781 cacaccccgc taccccgagg agctggagct gttggtcagc cagcagtcac cctgcggacg 
841 ggccaccagc agctcccggg tctggtggga cagcagcttc ctgggtggtg tggtacacct 
901 ggaggctggg gaggaggtgg tcgtccgtgt gctggatgaa cgcctggttc gactgcgtga 
961 tggtacccgg tcttacttcg gggctttcat ggtgtgaagg aaggagcgtg gtgcattgga 

1021 catgggtctg acacgtggag aactcagagg gtgcctcagg ggaaagaaaa ctcacgaagc 
1081 agaggctggg cgtggtggct ctcgcctgta atcccagcac tttgggaggc caaggcaggc 
1141 ggatcacctg aggtcaggag ttcgagacca gcctggctaa catggcaaaa ccccatctct 
1201 actaaaaata caaaaattag ccggacgtgg tggtgcctgc ctgtaatcca gctactcagg 
1261 aggctgaggc aggataattt tgcttaaacc cgggaggcgg aggttgcagt gagccgagat 
1321 cacaccactg cactccaacc tgggaaacgc agtgagactg tgcctcaaaa aaaaaaaaaa 
1381 aaaaaaaaaa aaaaa 
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SEQIDNO:256 

Amino acid sequence of human TNFSF14 variant ORF number 2 encoded by the DNA 
sequence shown in SEQ ID NO: 255. 

MEESVVRPSVFVVIXKJTDIPFTRLGRSHRRQSCSVARDGPAGSW 

HLTGANSS LTGSGGPLLWBTQIiQLAPLRGLS YHDGALVVTTKAOYYYI YSKVQLGGVG CPL 
GLASTITHGLYKRTPRYPEBLBLLVSQQSPCGRATSSSRVWWDSSPLGGWHLEAGEEW 
VRVLDBRIiVRLRDGTRSYFGAFMV 

SEQ ID NO: 257 

gi|9507194[ref]NMJ)19418.1| Mus musculus tumor necrosis factor (ligand) superfamily, 
member 14 (Tn£sfl4), mRNA 

1 gcccaacacg ctcgggcagt ttgcacagcc cgagcgtgtt gggcaattgt ggtttcctcc 
61 ggagaggagg aactcaggct tgccaaccct ttccctgggc ttcggagcct cagctgctct 
121 ggcatggaga gtgtggtaca gccttcagtg tttgtggtgg atggacagac ggacatccca 
181 ttcaggcggc tggaacagaa ccaccggaga cggcgctgtg gcactgtcca ggtcagcctg 
241 gccctggtgc tgctgctagg tgctgggctg gccactcagg gctggtttct cctgagactg 
301 catcaacgtc ttggagacat agtagctcat ctgccagatg gaggcaaagg ctcctgggag 
361 aagctgatac aagatcaacg atctcaccag gccaacccag cagcacatct tacaggagcc 
421 aacgccagct tgataggtat tggtggacct ctgttatggg agacacgact -tggcctggcc 
481 ttcttgaggg gcttgacgta tcatgatggg gccctggtga ccatggagcc cggttactac 
541 tatgtgtact ccaaagtgca gctgagcggc gtgggctgcc cccaggggct ggccaatggc 
601 ctccccatca cccatggact atacaagcgc acatcccgct acccgaagga gttagaactg 
661 ctggtcagtc ggcggtcacc ctgtggccgg gccaacagct cccgagtctg gtgggacagc 
721 agcttcctgg gcggcgtggt acatctggag gctggggaag aggtggtggt ccgcgtgcct 
781 ggaaaccgcc tggtcagacc acgtgacggc accaggtcct atttcggagc tttcatggtc 
841 tgaaggctgc ggtgacaatg tattttgtgg agggacctct ccaggactca cctcaaaccc 
901 agcaataggg tttgaagtcc tccctttaag gagccctgaa ctctgcagtg ctcggggcgg 
961 tgtagactgc tgacctgctt tgggcaatct tcaaatcaga gacctggaga cttggggcgt 
1021 ggagcccagg agcgaggggt cagctcattt gcctgatatt caggaagaaa gaatcaagct 
1081 ggggtattta tgcttctgat gcaaacactg agatttcggc tttctgggtt ttgagctgga 
1141 ggcaagaaac cttcccagag tgtcatcagg accatgttgg caggacttgg ggctccagac 
1201 ttgccaccac actctggcct ctcccatcca tccgctgcat tggtttccag ccaccaaaac 
1261 agcactggcc ccctggctgc aactggccag gtacgagctt ctgagcacct acattcctca 
1321 gggacatctt gatgagatct cagtactcag tccaatgcgc agcagcgaca gacatgccag 
1381 gaatggttgg tcagaaggga agggaggaaa gggaggaaag aagggaatgc agaagagaag 
1441 gggggaaaac aagaccaaaa caaaacagca acaacaaagc ggcagggagg aggtgacacc 
1501 cttggggata ctttagtcaa cacacttaga acagattgtg ccaggcctgt tggattcctg 
1561 gagttgatgg gatcgtggga aggcacaatg gggagcaagt gggcttgggt tatggctcag 
1621 tgggtaaagt gcaattatgg ggatctgagt ttgaatccct ggtacccata taaagacaca 
1681 gatgcggtga tgggcacttg tgacaatgag atcatcaata gggaatggag acaggaggga 
1741 cctctggggt tcactggcca ggcagtctag ctgaatcaaa gagctccaag ttcagtcgat 
1801 agctcctgaa gatgacaact gaggctattc tccaaacccc acacgcagga cacatgcgta 
1861 at 

SEQ ID NO: 258 

Amino acid sequence of mouse TNFSF14 encoded by the DNA sequence shown in SEQ ID 
NO: 257. 

MES WQPS VFWDGQTD I P7RRLEQNHRRRRCGTVQVS LALVLLLGAGLATQGWPLLRLH 
QRLGDIVAHLPDGGKGSWEKLIQDQRSHQANPAAKLTGANASLXGZGGPLLHETRLGLAP 
LRGLTYHDGALVTMEPGYYYVYSKVQLSGVGCPQGIiANGLPITHGLYKRTSRyPKELELL 
VSRRSPCGRANSSRVWWDSSFLGGVVHLBAGEEVVVRV 
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gi|27672737|reflXMJ236794.1 1 Rattus norvegicus similar to LIGHT protein (LOC301 133), 
niRNA 

1 atggagagtg tggtacagcc ttcagtgttt gtggtggatg gacagacaga catcccattc 
5 61 aggcggctgg gacagaacca caggagacgg cactgcggca ctgtccaggt cagcctggcc 
121 ctgctgctgc tgctgggtgc tgggctggcc actgagggct ggtttctcct gagactgcat 
181 cagcgtcttg gggacatagt agctcatctg ccagatggag gcaaaggctc ctgggagaag 
241 ctgatacaag gagctaacgc cagcttgata ggcattggtg gacctctgtt atgggagaca 
301 caacttggcc tggccttcct gaggggcctg acgtatcatg atggggccct ggtgaccacc 

10 361 gaggctggct actactacgt gtactccaaa gtgcagttga gtggtgtggg ctgcccccag 
421 gggctggcca atggcctccc catcacccac gggctgtaca agcgcacatc ccgatacccc 
481 aaggagttag aactgctggt cagccggcgg tcaccttgtg gccgggccaa cagctcccga 
541 gtctggtggg acagtagttt cctcggcgga gtggtacatc tggaggccgg agaagaggtg 
601 gtggtccgcg tgcctggaaa ccgcctggtc agaccacgtg atggcacgag gtcctatttc 

15 661 ggagctttca tgatctga 

SEQ ID NO: 260 

Amino acid sequence of rat TNFSF14 encoded by the DNA sequence shown in SEQ ID NO: 



MESWQPSVFVVTOQTDIPFRRLGQNHRRRHCGTVQVSL^ 
20 QRLGDI VAHL PDGGKGS WEKL IQGANASL IGIGGPLLWETQIX3LAFLRGLTYHDGALVTT 
BAGYYYVYSKVQLSGVGCPQGIiANGLPITHGLYKRTSRYPKELELLVSMSPCGRANSSR 
VWWDSS FLGGVVHLEAGEEVVVRVPGNRLVRPRDGTRS YPGAFMI 

SEQ ID NO: 261 

gi|13775596|ref|NM^024164.2| Homo sapiens tryptase beta 2 (TPSB2), mRNA 

25 l ggccaggatg ctgaatctgc tgctgctggc gctgcccgtc ctggcgagcc gcgcctacgc 

61 ggcccctgcc ccaggccagg ccctgcagcg agtgggcatc gttgggggtc aggaggcccc 
121 caggagcaag tggccctggc aggtgagcct gagagtccac ggcccatact ggatgcactt 
181 ctgcgggggc tccctcatcc acccccagtg ggtgctgacc gcagcgcact gcgtgggacc 
241 ggacgtcaag gatctggccg ccctcagggt gcaactgcgg gagcagcacc tctactacca 
30 301 ggaccagctg ctgccggtca gcaggatcat cgtgcaccca cagttctaca ccgcceagat 
361 cggagcggac atcgccctgc tggagctgga ggagccggtg aaggtctcca gccacgtcca 
421 cacggtcacc ctgccqcctg cctcagagac cttccccccg gggatgccgt gctgggtcac 
481 tggctggggc gatgtggaca atgatgagcg cctcccaccg ccatttcctc tgaagcaggt 
541 gaaggtcccc ataatggaaa accacatttg tgacgcaaaa taccaccttg gcgcctacac 
35 601 gggagacgac gtccgcatcg tccgtgacga catgctgtgt gccgggaaca cccggaggga 
661 ctcatgccag ggcgactccg gagggcccct ggtgtgcaag gtgaatggca cctggctgca 
721 ggcgggcgtg gtcagctggg gcgagggctg tgcccagccc aaccggcctg gcatctacac 
781 ccgtgtcacc tactacttgg actggatcca ccactatgtc cccaaaaagc cgtgagtcag 
841 gcctgggttg gccacctggg tcactggagg accaacccct gctgtccaaa acaccactgc 
901 ttcctaccca ggtggcgact gccccccaca ccttccctgc cccgtcctga gtgccccttc 
961 ctgtcctaag ccccctgctc tcttctgagc cccttcccct gtcctgagga cccttcccca 
1021 tcctgagccc ccttecctgt cctaagcctg acgcctgcac cgggccctcc ggccctcccc 
1081 tgcccaggca gctggtggtg ggcgctaatc ctcctgagtg ctggacctca ttaaagtgca 
1141 tggaaacc. 

45 SEQ ID NO: 262 
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Amino acid sequence of human TPSB2 encoded by the DNA sequence shown in SEQ ID 
NO: 261. 

MLNLLLLALPVLASRAYAAPAPGQALQRVGIVGGQEAPR^ 

OSLIHPQWVLTAAHCVGPDVKDLAALRVQLREQHLYYQDQLLPVSRIIVHPQPYTAQIQA 
5 DIALLELEEPVKVSSHVHTVTLPPASETFPPGMPCWVTGWGDVDNDERLPPPFPLKQVKV 
P I MENH I CDAKYHIK3AYTGDD VR IVRDDMLCAGNTRRD S CQGDS GGPLVCKVNGTWLQAG 
WSWGEGCAQPNRPG I YTRVTY YLDWIHHYVPKKP 

SEQ ID NO: 263 

Amino acid sequence of human TPSB2, a soluble active secreted form derived from SEQ ID 
10 NO:262. 

I VGGQEAPRS KWPWQVSLRVHGP YWMHFCGGSL IHPQWVLTAAHCVGPDVKDLAALRVQL 
REQHLYYODQLLPVSRIIWPQFYTAQIGADIAIOiELEEPVKVSSHVHTVTLPPASETPP 
PGMPCWVTOWGDVDNDERLPPPFPLKQVKVPIMB^ICDAKYHLGAYTGnDVRIVRDDML 
CAGNTRRDS CQGDSGGPLVCKVNGTWLQAGWS WGEGCAQPHRPG I YTRVTYYLDW IHHY 
15 VPKKP 

SEQ ID NO: 264 

gi|68578 1 3|refjNM_01 078 1 . 1 j Mus musculus mast cell protease 6 (Mcpt6), mRNA 

1 ctaagatgct gaagcggcgg ctgctgctgc tgtgggcact gtccctcctg gctagtctgg 
61 tgtactcagc ccctcgccca gccaatcagc gagtgggcat cgtgggagga catgaggctt 

20 121 ctgagagtaa gtggccctgg caggtgagcc tgagatttaa attaaactac tggatacatt 
181 tctgcggagg ctctctcatc cacccacagt gggtgctcac tgcggcacac tgtgtgggac 
241 cgcacatcaa aagcccacag ctcttccggg tgcagcttcg tgagcagtat ctatactatg 
301 gggaccagct cctctctttg aaccggatcg tggtgcaccc ccactattac acggccgagg 
361 gtggggcaga cgttgccctg ctggagcttg aggtccctgt gaatgtctcc acccatatcc 

25 421 accccatatc cctgccccct gcctcggaga ccttcccccc tgggacatcg tgctgggtga 
461 caggctgggg cgacattgat aatgacgagc ctctcccacc tccttatcct ctgaagcaag 
541 tgaaggttcc cattgtggaa aacagcctgt gtgaccggaa gtaccacact ggcctctaca 
601 cgggagatga ttttcccatt gtccatgatg gcatgctgtg tgctggaaat accaggagag 
661 actcctgcca gccattctgt attggagatg acatctgatg agggtgacat ctacatttta 

30 721 caattactgc tgagtgctga cctttccctc tcttggatgg aggcctcatc cttgatggga 
781 aattaggtca gttgtcccac ccagtgctag gtgtcatctc tggctctctg tgtgtgtcac 
841 actcttcacc gctgaaccat ctctcgagtt ccttagtttt catttttaat gtcaaacata 
901 acagactcat ccatcacaaa aataaaaggt gaatgtaaaa aaaa 

SEQ ID NO: 265 

35 Amino acid sequence of mouse TPSB2 encoded by the DNA sequence shown in SEQ ID 
NO: 264. 

MIJa^LLLWALSl^ASLVYSAPRPANQRVGIVGaHEASESKWPWQVSLRFKIiNYWIHFC 
GGSLIHPQWVLTAAHCVGPHIKSPQLFRVQLREQYLYYGDQLLSLNRIVVHPHYYTAEGG 
ADVALLELEVPVNVSTHIHPISLPPASETFPPGTSCWVTGWGDIDNDEPLPPPYPLKQVK 
40 VP I VENSLCDRKYHTGI* YTGDDFP I VHDGMLCAGNTRRDS CQPFCIGDDI 

SEQ ID NO: 266 

gi|9S06886|ref)NM_0191 80. 1 1 Rattus norvegicus mast cell protease 6 (Mcpt6), mRNA 
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1 ggagagagga gccgagacag ccaagatgct gaagctgctg ctgctgctgg cactgtcccc 
61 cctggctagt ctggtgcacg cggccccttg cccagtcaag cagcgagtgg gcattgtggg 
121 aggacgagag gcttctgaaa gtaagtggcc ctggcaggtg agcctgagat >ttaaattcag 
181 cttctggatg catttctgtg gcggctccct cattcaccca cagtgggtgc tcactgcggc 
5 241 acactgtgtg ggactgcaca tcaaaagccc agagctcttc cgtgtacagc ttcgtgagca 
301 gtatctatac tatgcggacc agctactgac tgtgaaccgg accgttgtgc acccccacta 
361 ctacacagtc gaggatgggg cagacattgc cctgctggag cttgagatcc ctgtgaatgt 
421 ctccacccat atccacccca tatccctgcc ccctgcctcg gagaccttcc cctcggggac 
481 ttcttgctgg gtaacaggct ggggcgacat tgatagtgac gagcctctcc tgccacctta 
10 541 tcctctgaag caagtgaagg tccccattgt ggaaaacagc ctgtgtgatc ggaagtacca 
601 cactggcctc tacacaggag atgatgttcc cattgtccag gatggcatgc tgtgtgctgg 
661 aaataccagg agcgactcct gccagggaga ctcagggggc ccactggtct gcaaagtgaa 
721 gggtacctgg ctgcaagcag gagtggtcag ctggggtgag ggctgcgcag aggccaatcg 
781 tcctggcatt tacacccggg tgacgtacta cctggactgg attcaccgct atgtccctca 
15 841 gcgttcctga gacccatcca gggtcaggga agaaccaggc acctgctgtc tttaactcac 
901 tgcttcctgg ccagatggaa ccctggcctt ctttgtactc tgtctcccct gtctaccggg 
961 tgtccctctg agcccccact ttgttccacc ttgagtccct cgccactcct gtcccctctg 
1021 cctcccacca caacacagct gcactgtgcg gctccctctt ttctgtggct cattaaagta 
1081 tgtgaaaatt ttgctcc 

20 SEQIDNO:267 

Amino acid sequence of rat TPSB2 encoded by the DNA sequence shown in SEQ ID NO: 
266. 

MLKLLLLLALSPLASLVHAAPCPVKQR 

SLIHPQWVLTAAHCVOLHIKSPELFRVQLREQYLYYADQLLTVNRTVVHPHYYTVEDGAD 
25 IALLEIiKIPVNVSTHIHPISLPPASETPPSGTSCWVTGWGDIDSDEPLLPPYPLKQVKVP 
IVENSLCDRKYHTGLYTGDDVPIVQDQMLCAGOTRSDSCQODSGGPLVCKVKQTWLQAGV 
VS WGEGCAEANRPGI YTRVTYYLDWIHRYVPQRS 

SEQ ID NO: 268 

gi|18491001 |ref|NM_003881.2| Homo sapiens WNT1 inducible signaling pathway protein 2 
30 (WISP2),mRNA 

1 tgtgtgtgtg tgtgtgtgag cgcgcgcgcg cgcgcgcgtg tgtactcgtg cgtgtgcctg 
61 tgtgtgcctg ggagtgacct cacagctgcc ggaacataaa gactcacagg tccgcctccc 
121 aggctcaaag ctggctctgc aggggacatg agaggcacac cgaagaccca cctcctggcc 
181 ttctccctcc tctgcctcct ctcaaaggtg cgtacccagc tgtgcccgac accatgtacc 
35 241 tgcccctggc cacctccccg atgcccgctg ggagtacccc tggtgctgga tggctgtggc 
301 tgetgccggg tatgtgcacg gcggctgggg gagccctgcg accaactcca cgtctgcgac 
361 gccagccagg gcctggtctg ccagcccggg gcaggacccg gtggccgggg ggccctgtgc 
421 ctcttggcag aggacgacag cagctgtgag gtgaacggcc gcctgtatcg ggaaggggag 
481 accttccagc cccactgcag catccgctgc cgctgcgagg acggcggctt cacctgcgtg 
541 ccgctgtgca gcgaggatgt gcggctgccc agctgggact gcccccaccc caggagggtc 
601 gaggtcctgg gcaagtgctg ccctgagtgg gtgtgcggcc aaggaggggg actggggacc 
661 cagccccttc cagcccaagg accccagttt tctggccttg tctcttccct gccccctggt 
721 gtcccctgcc cagaatggag cacggcctgg ggaccctgct cgaccacctg tgggctgggc 
781 atggccaccc gggtgtccaa ceagaaccgc ttctgccgac tggagaccca gcgccgcctg 
841 tgcctgtcca ggccctgccc accctccagg ggtcgcagtc cacaaaacag tgccttctag 
901 agccgggctg ggaatgggga cacggtgtcc accatcccca gctggtggcc ctgtgcctgg 
961 gccctgggct gatggaagat ggtccgtgcc caggcccttg gctgcaggca acactttagc 
1021 ttgggtccac catgcagaac accaatatta acacgctgcc tggtctgtct ggatcccgag 
1081 gtatggcaga ggtgcaagac ctagtcccct ttcctctaac tcactgccta ggaggctggc 
1141 caaggtgtcc agggtcctct agcccactcc ctgcctacac acacagccta tatcaaacat 
1201 gcacacgggc gagctttctc tccgacttcc cctgggcaag agatgggaca agcagtccct 
1261 taatattgag gctgcagcag gtgctgggct ggactggcca tttttctggg ggtaggatga 
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1321 agagaaggca c.acagagatt ctggatctcc tgctgccttt tctggagttt gtaaaattgt 
1381 tcctgaatac aagcctatgc gtgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 



SEQIDNO:269 

Amino acid sequence of human WISP2 encoded by the DNA sequence shown in SEQ ID 
NO: 268. 



MRGTPKTHLIAFSLLCTjLSKTOT 

GEPCDQIiHVCDASQGLVCQPGAQPQQRQALCLLAEDDSSCEVNGRLYREGETFQPHCSIR 
CRCEDGGFTCVPLCSBDVRLPSWDCPHPRRVEVLGKCCPEWVCGQQGGIjQTQPLPAQGPQ 
FSOLVSSLPPGVPCPEWSTAWQPCSTTCGIiGMATRVSNQNRFCRLETQRRLCIiSRPCPPS 
RGRSPQNSAF 

SEQ ID NO: 270 

Amino acid sequence of human WISP2, a soluble active secreted form derived from SEQ ID 
NO:269. 



RTQLCPTPCTCPWPPPRCPLGVPLVlJrc^ 

AGPGGRGALCLLAEDDSSCEVNGRLYREGETFQPHCSIRCRCEDGGFTCVPLCSEDVRLP 
SWDCPHPRRVEVLGKCCPEWVCGQGGGLGTQPtPAQGPQPSGLVSSLPPGVPCPEWSTAW 
GPCSTTCGLGMATRVSNQNRFCRLETQRRLCLSRPCPPSRGRSPQNSAF 

SEQ ID NO: 271 



gi|839454Q|reflNMJ)l 6873.1 1 Mus musculus WNT1 inducible signaling pathway protein 2 
(Wisp2),mRNA 

1 cccacgcgtc cgcgctcctg atctccagag gaccccgggc tgggacaggg gccttggcga 
61 ggctgcagct gctgtggcag tagcttggga tggaggtctt tcttgctggg aactgaggag 
121 ctgagaggct cctgtcaggc tcctgtccta aactcttggc acttgcggtg gcttgggctt 
181 cacacactgt cagacacctt cttggtggcc tcctcggcct caggtttgaa gctggctcca 
241 caagggacac ggtgacatga ggggcaaccc actgatccat cttctggcca tttccttcct 
301 ctgcattctc tcaatggtgt attcccagct gtgcccagca ccctgtgcct gtccttggac 
361 accaccccag tgcccaccgg gggtacccct ggtgctggat ggctgtggct gctgtcgagt 
421 gtgtgcacgg aggctggggg agtcctgcga ccacctgcat gtctgcgacc ccagccaggg 
481 cctggtttgt cagcctgggg caggccccag tggccgtggt gctgtgtgcc tcttcgaaga 
541 ggatgacggg agctgtgagg tgaatggccg caggtacctg gatggggaga cctttaaacc 
601 caattgcagg gttttgtgcc gctgtgatga cggtggtttc acctgcctgc cgctgtgcag 
661 tgaggatgtg cggctgccca gctgggactg cccacgcccc aggagaatac aggtgccagg 
721 aaggtgctgc cccgagtggg tgtgtgacca ggcagtgatg cagccggcaa tccagccctc 
781 ctcagcccaa ggacaccaac tttctgccct tgtcactcct gcatcfcgccg atggcccctg 
841 tccaaactgg agcacagcct ggggcccctg ctcaaccacc tgtgggttgg gcatagccac 
901 ccgagtatcc aaccagaacc gattctgcca actggagatc cagcgtcgcc tgtgtctgtc 
961 cagaccctgc ctggcatcca ggagccacgg ctcatggaac agtgccttct agagccattg 
1021 cggggatgtg gatacagggc ctgccattct cagcaaatgt ccctaggacc aggccctgga 
1081 ctgatggtag atgcccctct ccatgctctt ggctgcagtt aactgtcctg ggtggattca 
1141 gtgtccagag cctctgagcg atccctgctc tgtctgaggt gggggaagca ggtgaccagc 
1201 tccatttctc tggattctga cccaggcttc tgggttctcc tggctagttc ctcaaaactt 
1261 ccctgtatga aaaggacaac caaaaggacc 'tttaaagcta agctgtactg ggcaagcctg 
1321 gccaccatgc tggggatagt gacagtaata ggtaccaggc agcagattgc ctgaaacatc 
1381 caggtecctt cttggacttc tatgtgcttg tcccaaagat tatgggtgac cttgtaagtg 
1441 tgcctttcct gatctgagaa caccctgccc ggctgggaag aattttctgg gaacatgaag . 
1501 agatggaatc acactattct taagagcgtt tgccaagtcc aggaacttga cctttgtatt 
1561 tgtaaaaata cacatctctt aaatgctcac aaagcaagag gctccacact tctggcaggc 
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1621 cagggccttt ctcttcagca tgagagagac aaggaacagt agagtaccct cctctggagg 
1681 actggcccgg tctggaataa acacccaaat caagtgtgga aaaaaaaaaa aaaa 



SEQ ID NO: 272 



5 Amino acid sequence of mouse WISP2 encoded by the DNA sequence shown in SEQ ID 
NO: 271. 

MRGNPLIHL1JUSFLCILSMVYSQ 

QESCDHLHVCDPSQGLVCQPGAGPSORQAVCLFBBDDGSCEVNGRRYLDQETFKPNCRVL 
raCDIX3GFTCLPLCSBDVRLPSWPCPRPRRIQVPGRCCPEWVCDQAVMQPAIQPSSAQGH 
1 0 QLSALVTPASADGPCPNWSTAWGPCSTTCGLGIATRVSNQNRFCQLEIQRRLCLSRPCLA 
SRSHGSWNSAF 



SEQ ID NO: 273 

gi|i3928801|reflNM_031590.1| Rattus norvegicus WNT1 inducible signaling pathway 
protein 2 (Wisp2), mRNA 

15 1 ctgcaaagat ctgacagacg cttctgatct ccagaggacc ctggggtggg acaggggctt 

61 ggcaaggctg cagcgctggg cagtggcttg gaatggaggt ctttattact gggaactgag 
121 gagctaagag gctcctgtca gcttgtccta aacccttagc acttgtggtg gcttgggctt 
181 cacacactgt cagacacctt cgtggtggcc tccacggctc acctccaggt ttgaagctgg 
241 ctccacaagg gacacggtga catgaggggc agcccactga tccgtcttct ggccacttcc 
20 301 ttcctctgcc ttctctcaat ggtgtgtgcc cagctgtgcc ggacaccctg cacctgtccc 
361 tggacaccac cccagtgccc acagggggta cccctggtgc tggatggctg tggctgctgt 
421 aaagtgtgtg cacggaggct gacggagtcc tgcgaacacc tgcatgtctg cgaacccagc 
4B1 cagggcctgg tttgtcagcc tggggcaggc cctggcggcc atggggctgt gtgtctcttg 
541 gatgaggatg acggtgactg tgaggtgaat ggccgcaggt acctggatgg agagaccttt 
25 601 aaacccaatt gcagggtcct gtgccgctgt gatgacggtg gcttcacctg cctcccgctg 
661 tgcagtgagg atgtgacgct gcccagctgg gactgcccac gccccaagag aatacaggtg 
721 ccaggaaagt gctgccccga gtgggtatgt gaccagggag tgacaccggc gatccagcgc 
781 tccgcggcgc aaggacacca actttctgcc cttgtcactc ctgcctctgc tgatgctcct 
841 tggccaaatt ggagcacagc ctggggcccc tgctcaacca cctgtgggct gggcatagcc 
30 901 acccgagtgt ccaaccagaa ccgattctgc caactggaga tccaacgccg cctgtgtctg 
961 cccagaccct gcctggcagc caggagccac agctcatgga acagtgcttt ctaaggtcaa 
1021 ctggggatgc ggatacaggg cctgccatcc tcagcaaatg acccttggac caggccctgg 
1081 actgctggta gatgctcttc tccatgctct tggctgcagt taactgtcct gcttggattc 
1141 actgtgtaga gccactgagc gatccctgct ctgtctgagg taggcggagc aggtgaccag 
35 1201 ctccagttct ctggttcagc ctggaattct gggttctcct ggctcattcc tcaaaacatc 
1261 cctgtacaaa aaggacaacc aaaaagacct ttaaacctag gctatactgg gcaaacctgg 
1321 ccaccgtgct ggggataagg tcaatgttag gaccagacag cagattgcct gaaacttcca 
1381 attcccttct tggacttctg tatgcttgta accaaagatg atgaatgaac tcgtaagtgt 
1441 accttccctg acctgagaac accctgcctg ctcgggaagt attcaggggc agaattctct 
40 1501 gtgaacatga agagatgaat cacactgtcc ttaagaaatt cctcaaagtc caggagcttg 
1561 agctttgtat tttcaggaat gcacatctct taagcactcg caaaacagga aggctccaca 
1621 octet aacag ccagggcctt tctcttcagc atgagaaaga caagggacag cagagtactc 
1681 tegtctggag gactagtcta gectagaata aacacccaaa tcaagcgtga aaaaaaaaaa 
1741 a 

45 SEQ ID NO: 274 

Amino acid sequence of rat WISP2 encoded by the DNA sequence shown in SEQ ID NO: 
273. 

MRGSPLIRLLATSFLCIjLSMVCAQLCR^ 
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TESCEHLHVCEPSQGLVCQPGAGPGGHGAVCLLDEDDGDCEVN^ 

CRCDDGGPTCIjPLCSEDVTLPSVTDCPRPKRIQVPGKCCPEWVCDQGVTPAIQRSAAQGHQ 
LSALVTPASADAPWPNWSTAWGPCSTTCGLGIATRVSNQNRFCQLEIQRRLCLPRPCLAA 
RSHSSWNSAP 
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1 WHAT IS CLAIMED IS: 

2 1 . A method for identifying an agent for treating an obese, diabetic or 

3 pre-diabetic individual, the method comprising the steps of: 

4 (i) contacting an agent to a polypeptide encoded by a polynucleotide that 



5 hybridizes to a nucleic acid encoding SEQ ID NO: 213, 2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 

6 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 5 1 , 53, 55, 57, 59, 61, 63, 64, 66, 68, 

7 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 112, 

8 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 

9 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 

10 188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 209, 21 1, 215, 217, 219, 221, 223, 225, 

1 1 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 25 1 , 252, 254, 256, 258, 260, 

12 262, 263, 265, 267, 269, 270, 272 or 274 in 50% formamide, 5X SSC, and 1% SDS at 42°C 

1 3 followed by a wash in 0.2X SSC, and 0. 1 % SDS at 55°C; and 



14 (ii) selecting an agent that modulates the expression or activi ty of the 

1 5 polypeptide or that binds to the polypeptide, thereby identifying an agent for treating an 

16 obese, diabetic or pre-diabetic individual. 

1 2. The method of claim 1, wherein the polypeptide comprises an amino 



2 acid sequence at least 95% identical to SEQ ID NO: 213, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 

3 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 

4 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 1 10, 1 12, 

5 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 

6 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 182, 184, 186, 

7 188, 190, 192, 194, 196, 197, 199, 201, 203, 205, 207, 209, 21 1, 215, 217, 219, 221, 223, 225, 

8 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 245, 247, 249, 25 1,252, 254, 256, 258, 260, 

9 262, 263, 265, 267, 269, 270, 272, 274 or a protein domain thereof. 



1 3. The method of claim 1 , the method further comprising detecting 

2 whether the selected agent modulates weight and/or obesity. 

1 4. The method of claim 1 , the method further comprising detecting 

2 whether the selected agent modulates insulin sensitivity. 
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1 5. The method of claim 1 , wherein step (ii) comprises selecting an agent 

2 that modulates expression of the polypeptide. 

1 6. The method of claim 1, wherein step (ii) comprises selecting an agent 

2 that modulates the activity of the polypeptide. 

1 7. The method of claim 1 , wherein step (ii) comprises selecting an agent 

2 that specifically binds to the polypeptide. 

1 8. The method of claim 1 f wherein the polypeptide is expressed in a cell 

2 and the cell is contacted with the agent. 

1 9. The method of claim 1 , wherein the polypeptide comprises SEQ ID 



2 NO: 213, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 33, 35, 37, 39, 40, 42, 44, 46, 

3 47, 49, 51, 53, 55, 57, 59, 61, 63, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 

4 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 

5 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 161, 163, 165, 167, 169, 

6 171, 173, 175, 177, 179, 181, 182, 184, 186, 188, 190, 192, 194, 196, 197, 199,201,203,205, 

7 207, 209, 21 1, 215, 217, 219, 221, 223, 225, 227, 229, 230, 232, 234, 236, 238, 240, 242, 243, 

8 245, 247, 249, 251, 252, 254, 256, 258, 260, 262, 263, 265, 267, 269, 270, 272 or 274. 
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